The results of sequencing for the nucleotide sequence for the fusion protein 
STH. The sequence for gene of interest is located fronn nucleotide 41 to 
nucleotide 664; and the sequence for linker peptide is located from nucleotide 
449 to nucleotide 466. 




The nucleotide sequence for the fusion protein STH, wherein the nucleotide 
sequence for the linker peptide Is underlined 



TCAAGTTCATTCGACAAAGGAAAATATAGAAAAGGCGATGACGCGAGTTAT 

TTTGAACCAACAGGCCCGTATTTGATGGTAAATGTGACTGGAGTTGATGG 

TAAAGGAAATGAATTGCTATCCCCTCGTTATGTCGAGTTTCCTATTAAACCT 

GGGACTACACTTACAAAAGAAAAAATTGAATACTATGTCGAATGGGCATTA 

GATGCGACAGCATATAAAGAGTTTAGAGTAGTTGAATTAGATCCAAGCGCA 

AAGATCGAAGTCACTTATTATGATAAGAATAAGAAAAAAGAAGAAACGAAG 

TCTTTCCCTATAACAGAAAAAGGTTTTGTTGTCCCAGATTTATCAGAGCATA 

TTAAAAACCCTGGATTCAACTTAATTACAAAGGTTGTTATAGAAAAGAAAG 

GATCCCTGGGTCCGCGTA TTACTTACACTGATTGTACAGAATCGGGTCAA 

AATTTGTGCCTCTGCGAGGGAAGCAATGTTTGCGGTAAAGGCAATAAGTG 

CATATTGGGTTCTAATGGAAAGGGCAACCAATGTGTCACTGGCGAAGGTA 

CACCGAAGCCTGAAAGCCATAACAACGGCGATTTCGAAGAAATTCCAGAA 

GAATATTTACAATAA 
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Abstract 

Expression and purification of proteins in recombinant DNA systems is a powerful and widely used technique. Frequently there 
is the need to express the protein of interest as a fusion protein or chimeric protein. Fusion protein technology is frequently used to 
attach a "signal" which can be used for subsequent localization of the protein or a "carrier" which can be used to deliver a 
"therapeutic" such as a radioactive molecule to a specific site. In addition to these applications, fusion protein technology can be 
employed for several other useful purposes. Of these, the most frequent reason is to provide a *tag' or *handle' which will aid in the 
purification of the protein. Another useful purpose is to improve the expression or folding of the protein of interest. In these latter 
two situations, it is often necessary to remove the fusion partner before the recombinant protein of interest can be used for further 
studies. This removal process involves the insertion of a unique amino acid sequence that is susceptible to cleavage by a highly 
specific protease. Thrombin and factor Xa are the most frequently used proteases for this application. The purpose of this review is 
to discuss the application of thrombin and factor Xa for the cleavage of fusion proteins. It is emphasized that while these enzymes 
are quite specific for cleavage at the inserted cleavage site, proteolysis can frequently occur at other site(s) in the protein of interest. It 
is necessary to characterize the protein of interest after cleavage from the afllinity label to assure that there are no changes in the 
covalent structure of the protein of interest. Examples are presented which describe the proteolysis of the protein of interest by either 
factor Xa or thrombin. 

© 2003 Elsevier Science (USA). All rights reserved. 



The use of recombinant DNA technology for the 
expression of proteins in both prokaryotic and eukary- 
otic cells is a well-accepted technology. Often such 
proteins are expressed as fusion or chimeric proteins. 
The use of a fusion partner as carrier for either a "sig- 
nal" or "pharmaceuticar' is of significant value [1-6]. A 
more frequent application is the use of a fusion partner 
to provide a "tag" which can be used for the subsequent 
purification. Examples of such "tags" are the hexahis- 
tidine [7], streptavidin [8], glutathione-5-transferase [9], 
and maltose-binding protein [10]. These *tags* are par- 
ticularly useful because for each there is a corresponding 
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affinity matrix that when used under appropriate con- 
ditions will bind and release the *tag.' Once the chimeric 
molecule has been isolated, it is usually necessary to 
remove the "tag" before subsequent use of the expressed 
protein of interest [11]. The most popular method to 
remove the "tag" involves the use of a specific protease 
such as thrombin or factor Xa. This involves the inser- 
tion of a unique amino acid sequence that is specific for 
cleavage by either factor Xa or thrombin between the 
protein of interest and the fusion partner. In the current 
article, we review the use of this technology. 

Both thrombin and factor Xa are trypsin-like serine 
proteases which will cleave peptide bonds on the car- 
boxyl side (P',) of a basic amino acid residue. While 
trypsin will effectively cleave peptide bonds when either 
lysine or arginine precedes the carboxyl group, thrombin 
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and factor Xa have a preference for peptide bonds in 
which the carboxyl function is provided by an argininyl 
residue. In addition, while trypsin will cleave essentially 
all peptide bonds in a protein where the carboxyl moiety 
is contributed by lysine or arginine, thrombin and factor 
Xa are far more specific and cleave only certain peptide 
bonds. This specificity is contributed by the sequence of 
amino acids surrounding the scissile peptide and more 
particularly that sequence which precedes the scissile 
bond. The differences between trypsin and thrombin/ 
factor Xa reflect the fact that trypsin is a "digestive" 
protease while thrombin and factor Xa are regulatory 
proteases. 

The specificity of thrombin has been studied by a 
number of investigators [12]. The most common linker 
sequence is LVPRG or LVPRGS, which is derived from 
the sequence in bovine factor XIII that is cleaved by 
thrombin during the activation process [13]. Cleavage 
occurs at the arginine residue, resulting in the protein of 
interest being extended at its amino-terminal end by 
either a G or GS. The linker sequence for factor Xa 
cleavage is usually lEGR which is derived from the se- 
quences in prothrombin which are hydrolyzed by factor 
Xa during the conversion of prothrombin to thrombin 
[14]. While cleavage by thrombin or factor Xa in the 
linker sequence region is reasonably specific, it is not 
absolute. Cleavage can occur at other sequence sites in 
the protein. There have been a limited number of studies 
on the action of thrombin on proteins other than those 
involved in blood coagulation. Lonsdale-Eccles et al. 
[15] studied the hydrolysis of chemically modified pro- 
teins by thrombin and factor Xa. Both enzymes hydro- 
lyze peptide bonds in which the carboxyl moiety is 
contributed by arginine or lysine and hydrolysis at 
peptide bonds containing 5-(P-aminoethyl)cysteine is 
also observed. In performic acid-oxidized bovine chy- 
motrypsinogen, bovine thrombin and factor Xa cleaved 
at the argininyl residue in the sequence GVYARVTA. 
Factor Xa also cleaved the following sequence 
SGLSRIVN which was not cleaved by thrombin. Bovine 
thrombin cleaved at the arginine residues in the se- 
quence QVRLG in maleated performic acid-oxidized 
bovine trypsinogen; this sequence was not cleaved by 
factor Xa. Factor Xa did cleave at the arginine residue 
in the sequence NSRVA in maleated, performic acid- 
oxidized bovine trypsinogen; this sequence was also 
hydrolyzed by thrombin. Thrombin cleaved at the lysine 
residues in the sequences MKSRNL, RCKPVN, and 
SSKYPN in performic acid-oxidized ribonuclease. Fac- 
tor Xa did not hydrolyze any peptide bonds in performic 
acid-oxidized ribonuclease. The enzymatic hydrolyses 
were performed in either 0.2 M NH4CO3, pH 8.0, or in a 
pH-stat. The bicarbonate-buffered digest contained the 
substrate protein at a concentration of 5.0mg/mL and 
either factor Xa (200|ig/mL) or thrombin (50|ig/mL). 
The digest was performed for 8-9 h. 



Chang [16] studied the action of thrombin on 30 
polypeptide hormones. The sequences cleaved are 
shown in Table 1 . There are significant differences in the 
rate of hydrolysis of these bonds. With secretin, the time 
required to obtain 50% cleavage of the Arg-Asp is 
11.5h. After 48 h of hydrolysis, there was limited 
cleavage of the Arg-Leu peptide bond. It is interesting 
to note that heparin increased the rate of hydrolysis of 
the Arg-Asp peptide bond. There was approximately 
35% cleavage of the two scissle peptide bonds in vaso- 
active intestine protein after 48 h. There was 50% 
cleavage of the Arg-Val in choleocystokinin after 75 min 
and 23% cleavage of the Lys-Asparagine bond at 24 h. 
There was 50% cleavage of the Lys-Leu bond at 5 h. 

Chang et al. [17] studied the hydrolysis of antibody 
light chains by thrombin. Mouse k light chains 10K26 
and 10K44 from anti-(azobenzene arsonate) antibodies 
contain 20 Arg/Lys-X peptide bonds. Only two are 
cleaved by thrombin; NTLPRTFGG has a 50% cleavage 
time of 10 min with 0.5NIH units thrombin and 1 nmol 
protein in 20 ^L of 50 mM NH4CO3, pH 8.0, at 25 °C. 
There is cleavage of the sequence SPIVKSFN, which is 
hydrolyzed at the lysine residue with a 50% time of lOh. 

The purpose of reviewing the above information was 
to demonstrate that both thrombin and factor Xa can 
hydrolyze a variety of peptide bonds in proteins. As 
such, characterization of the desired product protein 
after cleavage of the fusion protein is essential. Reten- 
tion of biological activity does not necessarily imply 
retention of native covalent structure. The following 
studies are discussed as examples of "extraneous cleav- 
age" by thrombin or factor Xa in fusion proteins. 

Raftery et al. [18] designed a fusion protein between 
murine SlOO protein MRP14 and glutathione-S'-trans- 
ferase. The thrombin cleavage sequence LVPRGS was 
placed between the two proteins. The fusion protein was 
expressed in Escherichia coli and was isolated from the 



Table 1 



Sequences cleaved by thrombin in polypeptide hormones 



Polypeptide homones^ 


Sequence cleaved 


Secretin 


ELSLSRLRDSA 


Secretin 


ELSLSRLR (much 




slower than above) 


Vasoactive intestine polypeptide 


DNYTRLRK 


Vasoactive intestine polypeptide 


YTRLRKQM 


Choleocystokinin 


APSGRVSM 


Choleocystokinin 


VSMIKNLQ 


Dynorphin A 


RIRPKLKW 


Somatostatin-28 


AMAPRERK 


Somatostatin-28 


NFFWKTFT 


Gastrin releasing peptide 


KMYPRGNH 


Salmon calcitonin 


QTYPRTNT 



^Thc reaction mixtures contained 0.5 NIH units thrombin and 
1.0 nmol peptide in 20 pL of 50 mM NH4CO3, pH 8.0, at 25 °C. The 
conditions were designed to obtain an enzyme/substrate ratio of 1:60 
(w/w). 
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culture media using glutathione-agarose beads. The 
isolated fusion protein was digested with thrombin 
(20NIH units/mL in 25 mM Tris and 250 mM NaCl, 
pH 7.5, for 60min at 37 ^C). HPLC analysis of the 
products revealed a component with a molecular mass 
of 13,062 Da consistent with MRP 14 with a GS exten- 
sion at the amino terminal end. More extensive digestion 
(3 h) resulted in two components, one with a molecular 
mass of 13,062 Da and consistent with native MRP 14 
with GS extension at the NH2-terminal end and second 
component with a molecular mass of 11,919 Da. Pro- 
longed digestion (24 h) resulted in only the 11,919 Da 
protein, corresponding to residues 1-102. This fragment 
resulted from the thrombin cleavage at the sequence 
NNPRGHGH near the carboxyl terminus. The trun- 
cated MRP14 had reduced zinc binding compared to the 
native or recombinant full-length MRP 14. 

Zaitseva et al. [19] studied chimeric proteins, which 
are components of the "ABC transporters." They pre- 
pared a fusion protein between RbsA and RbsC with the 
factor Xa cleavage sequence lEGRGH between the two 
proteins. A poly-His leader was attached for the purpose 
of purification. Digestion with factor Xa in. lOOmM 
sodium phosphate, pH 8.0, with 20% glycerol, 0.1% 
dodecylmaltoside, 1.5 M NaCl, and 1 mM CaCl2 gave 
complete digestion of the factor Xa linker sequence but 
there was also cleavage at site in the RbsC protein at a 
VSGR (residues 6-9) sequence. The use of 50 mM Tris, 
pH 7,5, with 20% glycerol, 0.15 M NaCI, 0,01% dode- 
cylmaltoside, and 1 mM CaCh resulted in slower hy- 
drolysis, although complete digestion at the factor Xa 
linker sequence was achieved after 4h at 23 °C with 
minimal but detectable cleavage at the secondary site, A 
factor Xa cleavage site was also inserted between the 
poly-His sequence and RbsA. This site is . hydrolyzed 
more slowly than the factor Xa cleavage site between 
RbsA and RbsC. 

Muse and Bender [20] studied fusion proteins be- 
tween the nitrogen assimilation control protein (NAC) 
from either Klebsiella aerogenes (NACk) or E. coli 
(NACe), and maltose-binding protein. A factor Xa 
cleavage site lEGR was inserted between the NAC 
protein and the maltose-binding protein. The fusion 
proteins were purified by affinity chromatography on 
amylose-agarose resin. Elution was accomplished with 
1 M maltose. Digestion with factor Xa was performed in 
100 mM sodium phosphate, pH 7.0, containing 250 mM 
NaCl, 2.5 mM MgCh, and 1 mM of 2-mercaptoethanol. 
One microgram of factor Xa was added to 0.5 mg of the 
fusion proteins in lOOmL volume at 22 °C for 16 h. The 
reaction was terminated by the addition of phenyl- 
methylsulfonyl fluoride to a final concentration of 
20 mM. A fusion protein with a polyhistidine (Hise) was 
also prepared. Cleavage of NACK-maltose binding 
protein with factor Xa resulted in the formation of di- 
mers of NACk- NACE-maltose-binding protein was 



effectively cleaved by factor Xa but the resulting NACe 
was rapidly degraded (cleavage sites not provided). The 
NACE-maltose-binding protein with the Hise tail was 
resistant to cleavage by factor Xa. 

Belmouden et al. [21] prepared a fusion protein be- 
tween rat kidney long-chain L-2-hydroxy acid oxidase 
and glutathione-S'-transferase with a factor Xa cleavage 
site TEGRGIP inserted between the two proteins. Di- 
gestion with factor Xa was performed in 50 mM of Tris, 
100 mM of NaCl, and 4mM CaCh, pH 8.0, at 25 °C. 
There was little proteolysis detectable (SDS-polyacryl- 
amide gel electrophoresis) at enzyme-substrate ratios of 
1:100 or 1:10. Trypsin readily cleaved the factor Xa se- 
quence; there was also proteolysis at other sites resulting 
in three major bands. It was suggested that the factor Xa 
cleavage site between the two proteins was inaccessible 
to factor Xa. Cleavage of the linker site by factor Xa has 
also been a problem for other investigators. Ko and co- 
workers [22] prepared a fusion protein between the first 
nucleotide binding fold (NBFl) of the cystic fibrosis 
transmembrane conductance regulator (CFTR) and 
maltose binding protein with a factor Xa-sensitive linker 
site. The fusion protein resisted cleavage by factor Xa 
under a variety of solvent conditions; cleavage was ac- 
complished by first incubating the fusion protein in the 
presence of 0.08% sodium dodecyl sulfate for 16h at 
23 °C, followed by addition of factor Xa at a weight 
ratio of 1:100 and subsequent reaction at 4°C for 14 h. 
Sharma and Rose [23] observed a poor extent of cleav- 
age in a fusion protein comprised of the carboxyl-ter- 
minal nucleotide binding domain of P-glycoprotein and 
maltose binding protein. Wang et al. [24] have reported 
an extremely useful study on fusion proteins between the 
first nucleotide-binding domain and linker region of 
human multidrug resistance gene product and glutathi- 
one-5-transferase, thioredoxin, and maltose-binding 
protein. Expression of the glutathione- 5- transferase and 
thioredoxin fusion constructs as soluble proteins was 
low while much higher expression of the maltose binding 
protein construct was observed. Cleavage with factor Xa 
was unsatisfactory as there was significant non-specific 
proteolysis; insertion of a thrombin cleavage site re- 
sulted in satisfactory product. 

The above experiments are cited as case studies of 
problems associated with the use of thrombin and/or 
factor Xa to cleave fusion proteins. The point is that 
there must be careful and thorough structural charac- 
terization of the protein of interest after the cleavage 
step. With appropriate care, cleavage of fusion proteins 
with either thrombin-sensitive or factor Xa-sensitive 
sequences can be achieved. A partial compilation of 
studies using thrombin is given in Table 2 and for factor 
Xa in Table 3. 

An additional issue concerns the purity of the en- 
zymes used in the cleavage reaction. Thrombin, in par- 
ticular, can be problematic. Contamination with plasmin 
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Table 2 

Thrombin-catalyzed cleavage of fusion proteins 



Protein of interest 



Fusion 
protein 



Cleavage sequence 



Cleavage condition 



Reference 



A D P-ribosylati on 
factor 

p2-Adreno-receptor 

Transforming 
protein E7 

CRafl 
MEKl 
ERK2 

Murine osteopontin 
HTom20 



Cystic fibrosis 
transmembrane 
conductance 
regulator 
components 

Chlorocatechol 
1,2-dioxygenase 

Tenascin*C and 
Tenascin-R 



MRP14 
IL-6 

QPs3 

Heavy chain of 
tetanus toxin 

Procathepsin L 



ORF-14 
POMC 



COOH -terminal 

domain of 

apolipoprotein-E 
ACK-42 GST' 
Deoxyribonucleoside GST" 

kinase 



GST* 



His-tag^-Gs, SRGSLDPRSFLLRN 
PNDKYEPFWEDEE'^ 

GST* 



His-tag** 
GST* 
GST* 
GST* 

GST* 



GST* 

His-tag'' 
GST* 

GST* LVPRGS 
IgGl FC LVPRGS 

GST* 

MBPi 

GST* 

GST" 
GST* 

His-tag" LVPRG 



McelA and McelE 
proteins 



GST* 



50 mM Tris, pH 8.0, with 2.5 mM CaCh. 6\ig thrombin in [52] 
0.5 mL (concentration of fusion protein not given), 4h at room 
temperature on a platform rocker 

100 nM thrombin/1 mg protein per mL in 10 mM Tris-HCl, pH [53] 

7.4, at 25**C for 30min vnth constant rotation** ' 

100 units thrombin/25 mg fusion protein in 5mM [54] 

Hepes-150mM NaCI, pH 8.0, containing 10 Mg CaCb/mL 

at 23'*Cfor2hf 

Conditions not provided [55p 
Conditions not provided [55p 
Conditions not provided [55p 
0. 1 unit thrombin/ 10 \ig fusion protein at 37 " for 2 h. [56] 
Solvent conditions not provided 

The fusion protein was digested while bound to [57] 
glutathione-Sepharose 4B as a slurry in 20 mM sodium 
phosphate, 150mM NaCl, pH 7.4, with 1 mM dithiothreitol. 
Fifty micrograms of thrombin was added and digestion 
was allowed to proceed for 18h at 4*'C** 

Digestion of the column-bound by fusion protein by thrombin. [58] 
Five units thrombin/ 1 00 ^g protein in phosphate-buffered 
saline, pH 7.3, at 25 °C for 2h 



1 unit thrombin/mg fusion protein in 20 mM Tris-HCl, pH 8.0, [1 1] 
containing 150mM NaCl and 2.5 mM CaC^at 18**C for 90 h 
Thrombin was used to digest the fusion protein bound to [59] 
glutathione-Sepharose 4B matrix. Solvent conditions were not 
provided; Pefabloc SC was included to inhibit residual 
thrombin 

50 units thrombin/liter culture at 37 °C for 24 h' [1 8] 

5 units of thrombin/50 fiL phosphate-buffered saline to elute [60] 
fusion protein bound to Protein A-Sepharose 

1 Mg thrombin/500 Mg fusion protein in 50 mM Tris-HCl, pH 8.0 [6 1 ] 

containing 0.25 M sucrose. Time/temperature not provided*^ 

1 unit thrombin/75 Mg fusion protein in lOmM Tris-HCl, [62] 

pH 7.4, containing 200 mM NaCl, 1 mM EGTA, 1 mM 

dithiothreitol, and 10 mM maltose at 4*'C for 16h'^ 

The fusion protein bound to glutathione-Sepharose 4B was [63] 

digested with 0.5-2,0 units thrombin/mg fusion protein in 

20 mM Tris-Ha, pH 8.4, containing ISOmM NaCl and 

2.5 mM CaCl2 at 20 ^'C for 20 h'' 

Thrombin was used to digest the fusion protein bound to [64] 
glutathione-Sepharose 4B' 

Thrombin (425 units/liter culture) was used to digest the fusion [65] 
protein bound to Glutathione-Sepharose 4B as a slurry in 
50 mM Tris-HCl, pH 8.5, containing 150mM NaCl and 
2.5 mM CaCK at 4^ for 1 h' 

Thrombin/fusion protein, 1/100 (w/w) in 20 mM Tris-HCl, pH [66] 
7.9, containing 0.5 M imidazole and 0.5 M NaCl for 2 h at room 
temperature™ 

Conditions not provided [67] 
Thrombin was used to cleave the fusion protein bound to [68] 
glutathione-Sepharose 4B. Thrombin (400 units in four column 
volumes of 140 mM NaCl, 2.7 mM KCI, 10 mM Na2P04, and 
1.8 mM KH2PO4, pH 7.3) was recirculated through the 
column 

For Mce 1 A, 1 50 units of thrombin was used to cleave the fusion [69] 
protein in the column eluate from the glutathione-Sepharose 
4B. The digestion was performed in phosphate-buffered saline 
at 25*'C, time not provided. For the McelE fusion protein, 
digestion was accomplished with the fusion protein bound to 
the glutathione-Sepharose 4B matrix 
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Table 2 {continued) 



Protein of interest 


Fusion 
protein 


Cleavage sequence 


Cleavage condition 


Reference 


Prion domain of 


GS-P 




According to manufacturer's instructions. The amount of 


[70] 


yeast Ure2P 






thrombin was increased by fourfold to compensate for reduced 
activity in 20% glycerol 




Nucleotide-binding 


GST« 




Cleavage with 20 units thrombin applied to the 


(71] 


domains of human 






glutathione-Sepharose 4B in 20-50 mM of Tris-HCl, 




multidrug 






pH 7.5, containing 50-100 mM NaCl, 4-1 OmM dithiothreitol, 




resistance- 






and 10% glycerol" 




associated protein 










(MRPl) 










Nucleotide-binding 


MBPj 




Linker can be cleaved by either thrombin or factor Xa 


[71] 


domains of human 








multidrug 










resistance- 










associated protein 










(MRPl) 










Human ribosomal 


GST» 




20 mM Hepes, pH 8.0, 150mM NaCl, 2mM CaCb in ice, and 


[72] 


protein S3a 






lO^g thrombin for 5h 



^ GST, glutathione-5-transferase. 
*'His-tag is His(6)- 

"^This is a relatively long cleavage sequence derived from the thrombin platelet receptor sequence and the anion-binding exosite-binding domain 
[73]. The thrombin cleavage site in this sequence is at the Arg-8/Ser-9 bond. 

The fusion protein containing the P2-adrenoreceptor-Gsa protein with or without the thrombin cleavage sequence was obtained as a membrane 
preparation. 

* Thrombin was obtained from Haematologic Technologies (Essex Junction, VT). 
^Thrombin was obtained from Sigma Chemical (St. Louis, MO). 

^This study described the development of an interesting assay system for the Raf/MEK/ERK kinase cascade. 

*^The thrombin used in this study was from a crude commercial source. 

' The thrombin used in this study was obtained from CalBiochem (San Diego, CA). 

^MBP, maltose-binding protein. 

'^Thrombin was obtained from New England Biolabs (Beverly, MA). 
'Thrombin was obtained from Amersham-Pharmacia Biotech (Piscataway, NJ). 

"*The use of room temperature to describe reaction conditions should be avoided. It is assumed that room temperature is 23 °C, 

" It was not possible to isolate the nonfused version of the NPD2 protein from the GST fusion protein due to proteolytic degradation. 



has been a problem in tlie past [25,26]. More advanced 
purification technology [27] tends to make this less of a 
problem today. 

This discussion would not be complete without a 
consideration of other approaches which have been 
advanced for the cleavage of fusion proteins. Tobacco 
etch virus protease (TEV) is a highly stringent protease 
[28-32]. A vector has been recently developed for use in 
high-throughput methods for expressing proteins for 
crystallography [33]. Eisenmesser et al. [34] used TEV to 
cleave a fusion protein between human interleukin 13 
and maltose binding protein (MBP-hIL-13), Purification 
of the fusion protein was accomplished on amylose- 
agarose. After cleavage with TEV (50 mM Tris, pH 7.6, 
containing 200 mM NaCl, 1 mM EDTA, and 2mM re- 
duced glutathione), the IL-13 was insoluble and resisted 
further purification. Coexpression of MBP-hIL-13 and 
TEV resulted in high levels of hIL-13 production. Che- 
lius et al. [35] have used TEV to obtain rat insulin-like 
growth factor binding protein from a thioredoxin fusion 
protein containing a hexahistidine sequence. Mohanty et 
al. [36] have examined the effect of detergents on the 
cleavage of fusion proteins by TEV. Some but not all 



zwitterionic and neutral detergents inhibited the cleav- 
age of fusion proteins byTEV. 

Approaches that involve the use of self-cleaving 
technology have also been advanced. Hung and Chiou 
[37] studied the expression of a serine protease (Tm-5) 
from Trimeresurus mucrosquamatus (Taiwan habu) in 
E. coli. A cleavage sequence for this serine protease was 
inserted between the poly-His coding sequence and the 
5'-end of the Tm-5 clone. Renaturation of the protein 
after expression demonstrated that the protease had 
refolded correctly and cleaved the sequence between the 
His tail and the protease. 

Hall and Kunkel [38] prepared a triple domain fusion 
protein represented by MHLl-VMAl-CBD, where 
MHLl is the MutL homolog protein of interest, VMAl 
is a splicing element (or intein), and CBD is the chitin 
binding domain from Bacillus circulans. After expression 
in a yeast system {Saccharomyces cerevisiae), a soluble 
cellular extract was applied to a chitin column which 
bound the construct through the chitin binding domain 
(CBD). After washing the column to remove unwanted 
material, self-cleavage of intein from the MHL-1 protein 
was induced by the addition of dithiothreitol. 
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Table 3 

Factor Xa-catalyzed cleavage of fusion proteins 



* roieiii oi inicicbi 




ica V ugc 

sequence 




R p rRTi pp 


ADP-ribosylation 


MBP* 




20 mM Tris, pH 8.0, 1 mM EDTA, 1 mM dithiothreitol, 5mM 


[52] 


factor 






MgCl2, and 1 mM benzamidine; 5% factor Xa (weight/weight) of 
fusion protein, 24-48 h at 4°C 




Tetanus toxin heavy 


MBP 


lEGRISE 


0.5-1.0:100 (factor Xa:protein,w/w), 10 mM Tris-HCl, pH 7.4, 


[62] 


chain 






200 mM NaCl, ImM ethylene glycol-bis(P-aminoethyl ether) 
iV,yv, jV',W-tetraacetic acid, and 1 mM dithiothreitol containing 
10 mM maltose, 22 for 24 h. Reaction was terminated by 
phenylmethylsulfonyl fluoride** 




Archaeoglobus fulgidus 


MBP« 




1 ng factor Xa/100|ig fusion protein in 20 mM Tris, 100 mM NaCl, pH 


[74] 


D>lactate 






8.0, with 2mM CaClz for 5 days at 4°C= 




dehydrogenase 










Soybean profilin 


MBP» 




1:20 (w/w) factor Xa/fusion protein in lOOmM phosphate-buffered 
saline, pH 7.6, 48 h at room temperature'**'' 


[75] 


_f 


_f 


lEGR 


10 mU factor Xa/^g fusion protein in Tris-bufTered saline (10 mM Tris, 
pH 8.0, containing 150mM NaCl, and TBS), 20^0, variable times 


[76] 


D-Glutamate-adding 


MBP* 




Conditions recommended by New England Biolabs, 24-72 h at 4 °0 


[77] 


enzymes 










Single*chain fibrin- 


Hirudin 




15 ng factor Xa/150 jig fusion protein, 50 mM Tris-lOOmM NaCl-1 mM 


[78] 


specific antibody^ 






CaCl2, pH 8.0, at room temperature** '' 




Streptokinase 


His-tag* 


IQGR 


Immobilized factor Xa in 50 mM Tris-lOOmM NaCl, pH 8.0, 
containing 10 mM EDTA* 


[79] 


FKBP and SpoOF 


His-tag' 


lEGR 


1/20 (w/w) factor Xa/fusion protein in 50 mM Tris, 100 mM NaCl, and 
5mM CaCb, pH 8.0, for 4h at room temperature*'' 


[80] 


p42»P* 


Glutathione-S- 
transferase 




1 unit factor Xa/0.1 mg fusion protein in 50 mM Tris, pH 8.0, containing 
lOOmM NaCl, 1 mM CaCb, and 0.5 lubrol-PX, 16h at room 
temperature'''* 


[81] 


Plasminogen 


His-tag* 


lEGR 


Cleavage with factor Xa was unsuccessful because of a putative second 


[82] 


N-terminal peptide 






factor Xa cleavage site in the fusion protein 




Human glandular 


HK 


lEGR 


Various factor Xa/prokallikrein ratios in 25 mM Tris, 50 mM NaCl, and 


(83] 


kallikrein 


Propeptide" 




2mM CaCl2, pH 7.5, with or without 0.1% bovine serum albumin 
(BSAr 




Prohevin 


MBP 




2-5% (w/w) factor Xa in 0.02% SDS for 72 h at 22 ^C 


[84] 


E. coii enzyme I 


MBP^ 


lEGR 


c 


[85] 


N-terminal domain 










Staphylokinase 


His-tag' 


lEGR 


60 ^iL factor Xa solution in 20 mM Tris, pH 8.0, containing lOOmM 
NaCl, 2mM CaClj for 9h at 37 °C 


[86] 


Nitrogen assimilation 


MBP'* 


lEGR 


1 Jig factor Xa/0.5mg fusion protein in lOOmM NaH2P04 


[20] 



control protein 

Hepatitis B virus e 

antigens 
Procathepsin L 



Tyrosine hydroxylase 

type I 
Toxoplasma gondii 

ROP2 fragment 
Plasmodium falciparum 

acidic-basic repeat 

antigen 
Several proteins from 

E. colP 

Ribosomal protein L5 



Glutathione- 
s-transferase 
MBP» 



MBP** 



MBP^ 



His-tag* 



Various fusion 
proteins with 
his-tag' 
MBP 



Na+/glucose Glutathione- 
cotransporter(SGLTI) 5-transferase 
FKBP and SpoOP» His-tag' 



(pH 7.0) with 250 mM NaCl, 2.5 mM MgClj, and 1 mM of 

2-mercaptoethanol 

"[87] 

Factor Xa/fusion protein (1/400, w/w) in 20 mM Tris, pH 7.5, [63] 
containing 0.5 M NaCl, 10 mM maltose, and 2mM CaCl2 for 24 h 
at 25 °e 

lEGR Factor Xa/fusion protein, 1/100 (w/w) in 10 mM Tris, pH 8.5, containing [88] 

200 mM NaCl and 2mM CaClj'^ 

2% (w/w) factor Xa in 20 mM Tris, pH 7.5, containing 200 mM NaCl, [89] 
ImM EDTA, and 10 mM maltose"'* 

Factor Xa/fusion protein, 1/50 (w/w) in 10 mM Tris, pH 8.0, with [90] 
100 mM NaCl and 2mM CaCb at room temperature for 72 h*^ ** 

lEGR 1 Jig factor Xa/50 ^g fusion proteins in 20 mM Tris-HCl, 1 00 mM NaCl, [91] 

and 2 mM CaCb, pH 8.0, at 23 °C for 3 h'^ 

1 ng factor Xa/20jig fusion protein in 10 mM Tris, pH 8.0, containing [92] 
200 mM NaCl, 5% glycerol, 1 mM EDTA, and 1 mM dithiothreitol at 
4''C for 19h 

30units factor Xa in 50mM Tris, pH 8.0 containing 1% Triton X-100, [93] 
10 mM glutathione, and 1 mM CaCh at 22 ^'C for 16 h 
lEGR 1/20, w/w, factor Xa/fusion protein in 50 mM Tris-HCK pH 8.0, [94] 

containing lOOmM NaCl and 5mM CaCb at room temperature for 
4h*»''' 
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Table 3 {continued) 



Protein of interest 


Fusion partner 


Cleavage 
sequence 


Cleavage conditions 


Reference 


029 replication protein 


MBP 


lEGR 


1 factor Xa/25-lOOMg fusion protein in 20 mM Tris-HCI, pH 7.5, 


[95] 


PI 






containing 200 mM NaCl and 10 mM p-mercaptoethanol, usually at 
4 "C, time not given, cleavage monitored by SDS-PAGE 




Polyoma virus capsid 


His-tag^ 


lEGR 


The cleavage of the fusion protein is accomplished with the fusion 


[96] 


protein VPl 






protein bound to the nickel affinity matrix (nickel-nitriloacetate- 
agarose). One hundred and eighty micrograms of factor Xa is added to 
the agarose matrix suspended in 20 mM Tris-HCI, 200 mM NaCl, 5% 
(v/v) glycerol, 2mM EGTA, and 10 mM of 2-mercaptoethanol, pH 8.0^ 




Hypodermin C 


Glutathione-5- 


lEGR 


Factor Xa/fusion protein, 1/2S 


[97] 




transferase 






Clq 


MBP* 


lEGR 


1 unit factor Xa/200^g fusion protein in 20 mM Tris-HCI, pH 8.0, 
containing 0.1 M NaCl, 2mM CaCb, and 5% (v/v) glycerol for 16 h 
(overnight) at 4**C 


[98] 


Parabutoxin 3 


MBP* 


lEGR 


0.5 unit factor Xa/^g fusion protein in 20 mM Tris-HCI, lOOmM NaCl, 


[99] 








and 2 mM CaC^, pH 8.0, for 72 h at room temperature^ 


Human ^-endorphin 


Outer 


lEGR 


1.2 Jig factor Xa/mL in 20 mM Tris-HCI, pH 7.0, at 23 °C for 12 h 


[100] 




membrane 








protein F 








Human tissue factor 


MBP* 




1:100, factor Xa:fusion protein in 20 mM Tris-HCI, 200 mM NaQ, 


[101] 








1 mM EDTA, and 1 mM dithiothreitol, pH 8.0, at 25 ^'C for 16 h 



''MBP, maltose binding protein. 

Factor Xa was obtained from from Sigma Chemical (St. Louis, MO). 
•^Factor Xa was obtained from New England Biolabs (Beverly, MA). 

**The term 'room temperature' is to be avoided in publications as such term is not definitive. At the present time, room temperature is considered 
to be in the range of 15-30 "C; 23**C is a currently accepted value for the purpose of compliance with Good Laboratory Practice (GLP) requirements. 
•^Factor Xa was obtained from MobiTech (Gottingen, Germany). 

*^This work was focused on the development of some novel recombinant factor Xa preparations for use in the cleavage of fusion proteins. 
Two fusion proteins were used in this study: a biologically inactive a-factor-hirudin fusion protein and a maltose binding protein-calmodulin. 

^This work describes the construction of a novel biotherapeutic which would release hirudin at the site of fibrin clot formation in vivo via 
cleavage of the fusion protein by factor Xa. It is suggested that this would provide *'an effective localized antithrombin activity. . . without systemic 
bleeding complications." 

Factor Xa was obtained from Boehringer-Mannheim (Germany). 

*His-tag is His(6). 

-'Immobilized factor Xa was obtained from Protein Engineering Technology (Aarhus, Denmark). 
''Factor Xa was obtained from Promega (Madison, WI). 

'Factor Xa was obtained from Amersham-Pharmacia Biotech (Piscataway, NJ). 
™ Recombinant factor Xa was obtained from Professor J. Stenflo (Malmo, Sweden). 

" In this study, a factor Xa (or enterokinase) cleavage site is introduced into the propeptide region of the human glandular kallikrein precursor, 
activation of the recombinant protein during purification is avoided. 

"Secondary cleavages in the fusion protein were observed in this study. 

^This paper describes a technique study evaluating various fusion proteins with three E. coH proteins, NusA, GrpE, and bacterioferritin (BFR) as 
well as thioredoxin as fusion partners. 

''This is a technical paper describing heterogeneity in fusion proteins reflecting partial removal of the initiating methionine residue. 



Enterokinase has also been used to cleave fusion 
proteins. Examples include the fusion protein between 
Giardia duodenalis p-tubulin and a polyhistidine peptide 
[39]. Providencia stuartii aminoglycoside 2'-A^-acetyl- 
transferase and hexahistidine [40], rat mast cell protease 
4 and hexahistidine [41], and human secretory leucocyte 
protease inhibitor and hexahistidine [42], There are 
several more general references on the use of enteroki- 
nase [43,44] including a report on the expression of the 
enterokinase catalytic subunit in E. coli [44]. Several of 
the reports on the use of enterokinase for the cleavage of 
fusion proteins describe the use of green fluorescent 
protein inserted between the hexahistidine and the en- 
terokinase cleavage site [44-46]. This enables the facile 
visualization of the fusion protein during purification. 



Finally, there are situations where cyanogen bromide 
(CNBr) can be used for cleavage of the linker peptide 
[47-51]. The protein/pep tide must be stable to the rather 
harsh solvent conditions used for CNBr cleavage. 



Comment 

The expression of recombinant proteins as fusion 
proteins where the fusion partner facilitates purification 
is a very useful technology. However, it is clear that 
rigorous characterization of the protein of interest fol- 
lowing cleavage is essential and this is rarely done. Re- 
tention of biological activity does not necessarily mean 
that unwanted proteolysis has not occurred during the 
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cleavage step. Other issues, which need to be considered, 
are the choice of the solvent for the cleavage reaction 
with careful consideration of both pH and ionic 
strength. Each of the parameters can affect the rate and 
specificity of hydrolysis. In general, reactions involving 
thrombin and factor Xa are performed with near phys- 
iological conditions, with the pH in the range of 7.0-8.0 
and ionic strength equivalent to that of 0. 1-0.1 5 M 
NaCl. Due to its metal ion-dependent conformation, 
CaCl2 should be included in factor Xa mediated cleav- 
age reactions. These reaction conditions along with the 
ratio of enzyme to substrate, the temperature at which 
the reaction is performed as well as the duration of the 
reaction are parameters that will affect the rate and 
specificity of cleavage as well as the stability of the target 
protein. The fact that reaction conditions may require 
tailoring for each specific construct cannot be overem- 
phasized. Finally, the purity of the enzyme used for 
cleavage is critical and, as such, the source of cleavage 
enzyme is critical. 
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fusion protein of the epitope with a protein to be iso- 
lated, and isolate the protein using HPC-4-based affmity 
chromatography. In the preferred embodiment, a spe- 
cific protease cleavage site is inserted between the epi- 
tope and the protein so that the epitope can be easily 
removed from the isolated protein. In an example, a 
functionally active soluble tissue factor including the 
twelve amino acid epitope recognized in combination 
with calcium by HPC-4 and a factor Xa cleavage site 
was expressed from a vector inserted into a procaryotic 
expression system. The recombinant tissue factor can be 
rapidly isolated in a single chromatographic step using 
the HPC-4 monoclonal antibody immobilized on a suit- 
able substrate. Once isolated, the Protein C epitope is 
removed by cleavage with factor Xa, leaving the func- 
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EXPRESSION AND PURIFICATION OF 
RECOMBINANT SOLUBLE TISSUE FACTOR 

United States government has rights in this invention 5 
by virtue of grants from the National Institutes of 
Health, grant numbers ROl HL4422S and ROl 
HL29807. 

This is a continuation-in-part of U.S. Set. No. 
07/730,040 filed Jul. 12, 1991, now U.S. Pat. No. 10 
5,202,253, which is a continuation of U.S. Ser. No. 
07/292,447, abandoned, entitled "Monoclonal Anti- 
body against Protein C" filed Dec. 30, 1988 by Charles 
T. Esmon and Naomi L. Esmon, and a continuation-in- 
part of pending U.S. Ser. No. 07/683,682 entitled 15 
"Quantitative Clotting Assay for Activated Factor 
VII" filed Apr. 10, 1991 by James H. Mornssey. 

This invention is generally in the area of methods for 
purifying proteins, especially blood clotting proteins, 
using recombinant technology and an unique epitope of 20 
a monoclonal antibody directed against Protein C zy- 
mogen. 

Methods for purifying proteins have been used for 
many years and can be generally divided into chromato- 
graphic methods, for example, ion exchange chroma- 25 
tography, molecular weight sieving, high pressure liq- 
uid chromatography, affinity chromatography, and 
electrophoretic methods, for example, electrophoresis 
on agarose or acrylamide gels and isoelectric focusing. 
The usual disadvantages of all of these methods are that 30 
they require the starting material be passed through 
several processes to remove contaminants to the point 
where the desired material is substantially pure. 

In inununoaffinity chromatography, an antibody to 
the desired protein or other molecule is immobilized on 35 
a chromatographic substrate, the protein mixture is 
applied to the substrate under conditions allowing the 
antibody to bind the protein, the unbound material is 
removed by washing, and the bound protein is eluted 
using, for example, high or low pH, protein denaturants 40 
or chaotropes. The end result is a substantially pure 
protein which often lacks full biological activity. 

A variation of this method is described in U.S. Ser. 
No. 07/730,040 filed Jul. 12, 1991, which is a continua- 
tion of U.S. Ser. No. 07/292,447 entitled "Monoclonal 45 
Antibody against Protein C" filed Dec. 30, 1988 by 
Charles T. Esmon and Naomi L. Esmon, disclosing the 
properties of the monoclonal antibody, HPC-4. The 
hybridoma cell line which secretes the monoclonal 
antibody designated as HPC-4, was deposited with the 50 
American Type Culture Collection, 12301 Parklawn 
Drive, Rockville, Md, on Nov. 2, 1988, and assigned 
ATCC No. HB 9892. This deposit is available to the 
public upon the grant of a patent. 

HPC-4 binds protein C, not activated protein C 55 
(APC), and only in the presence of calcium. Thus, when 
the antibody is immobilized on an affinity support, pro- 
tein C can be isolated from either plasma-derived 
sources or from tissue culture expression systems under 
extremely mild conditions. This is important in main- 60 
taining the biological activity of the product and the 
stability of the solid support resin. Because activated 
protein C is not bound under any conditions, the result- 
ing product is completely free of APC. 

The antibody binds to a defined region of the protein 65 
C molecule that is contained within residues 6 and 17 of 
the heavy chain, specifically EDQVDPRLIDG 
K. This peptide can be immobilized directly on a solid 
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support resin and can be used to isolate the antibody in 
high concentrations from mouse ascites fluid or tissue 
culture supematants. This approach allows the isolation 
of the antibody in extremely pure form in high yield, 
even from very dilute solutions. The antibody can be 
removed from the solid support peptide either by the 
removal of calcium ions, if desired, or by 1.5 M guani- 
dine, which does not affect the function of the purified 
monoclonal antibody. 

It would be advantageous if the purification methods 
using the Protein C epitope in combination with HPC-4 
could be applied to the purification of other proteins, 
especially blood clotting proteins One such system 
using a completely different antibody has been de- 
scribed by Prickett, et al., BioTechniques 7(6), 580-589 
(1989), using a calcium-dependent antibody tiiat recog- 
nizes several of the enterokinase sites that are used for 
Ca^+coordination, including residues 1, 2, 3, 4, and 7 
from FIG. 4, p. 583 and Table I, p. 586. This sequence 
is attached to the N-terminus of the protein to be iso- 
lated, and subsequently removed by treatment with 
urokinase. 

It is therefore an object of the present invention to 
provide a method and means allowing isolation of puri- 
fied proteins, especially blood clotting proteins, in a 
single chromatographic step. 

It is a further object to provide recombinant proteins 
having an amino acid sequence specifically bound by a 
monoclonal antibody. 

SUMMARY OF THE INVENTION 

A method is disclosed to make any protein in a form 
that can be isolated rapidly from a solution using a 
specific monoclonal antibody designated **HPC-4**. 
HPC-4 binds a twelve amino acid epitope of Protein C 
zymogen recognized in combination with calcium. It 
has now been determined that it is possible to form a 
fusion protein of the epitope with a protein to be iso- 
lated, and isolate the protein using HPC-4-based affmity 
chromatography. In the preferred embodiment, a spe- 
cific protease cleavage site is inserted between the epi- 
tope and the protein so that the epitope can be easily 
removed from the isolated protein. In the most pre- 
ferred embodiment, the fusion protein is formed by 
expression of a recombinant gene inserted into an ap- 
propriate vector. 

In an example, a functionally active soluble tissue 
factor including the twelve amino acid epitope of Pro- 
tein C zymogen recognized in combination with cal- 
cium by a specific antibody to Protein C zymogen 
("HPC-4") and a factor Xa cleavage site was expressed 
from a vector inserted into a suitable procaryotic or 
eukaryotic expression system. The recombinant soluble 
tissue factor can be rapidly isolated in a single chro- 
matographic step using the HPC-4 monoclonal anti- 
body immobilized on a suitable substrate. Once isolated, 
the Protein C epitope is removed by cleavage with 
factor Xa, leaving the functionally active, soluble tissue 
factor. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is expression vector pIN-III-pelB encoding 
the HPC-4 epitope and factor Xa cleavage site, fol- 
lowed by several restriction enzyme sites for insertion 
of the gene encoding the protein to be isolated. 

FIG. 2 is a schematic representation of steps involved 
in the construction of pIN-III-pelB-tTF expression vec- 
tor. Subcloning of tTF was performed in two stages. At 
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the first stage an intermediary plasmid was prepared 
which harbored the tTF gene (pUC-tTF). At the sec- 
ond stage the tTF gene was removed from pUC-tTF 
and subcloned into the plN-III-pelB (see the text for 
more detail). The resulting pIN-III-pelB-tTF was used 5 
for the transformation of £1 coU (XLl-B) for expression. 

FIG. 3 is a photograph of an SDS gel electrophoresis 
of the tissue factor, which runs as a monomer in the gel 
even without disulfide bond reduction. 

FIG. 4 is a graph of Xa generation rate (m absorban- 10 
cc/min/ min) v ersus tTF (nM) for TTF -HPC -4 (open 
circles); TTF-293 (closed circles); and TTF-Xa-DIG 
(open squares). 

DETAILED DESCRIPTION OF THE , , 

INVENTION 

A fusion protein readily isolated by aflinity chroma- 
tography using HPC-4 antibody is prepared by insertion 
of a DNA sequence encoding the twelve amino acid 
HPC-4 epitope into a vector, followed by the gene 20 
encoding the protein to be isolated. In the preferred 
embodiment, a specific protease cleavage site is inserted 
into the vector between the epitope and protein coding 
sequence, so that the resulting fusion protein can be 
easily cleaved to yield the epitope peptide and the de- 25 
sired protein. 

In the following non-limiting example, a nucleic acid 
sequence encoding the twelve amino acid residue epi- 
tope for HPC-4 is inserted into an expression vector in 
an orientation such that the expression protein contains 30 
the HPC-4 epitope followed by a factor Xa cleavage 
site and then the amino terminus of tissue factor which 
has had the signal peptide cytosolictail and transmem- 
brane spanning domains deleted. 

The HPC-4 Monoclonal Antibody 35 

The HPC-4 antibody and uses thereof are described 
in U.S. Ser. No. 07/730,040 filed Jul. 12, 1991, which is 
a continuation of U.S. Ser. No. 07/292,447 entitled 
"Monoclonal Antibody against Protein C" filed Dec. 
30, 1988 by Charles T. Esmon and Naomi L. Esmon, the 40 
teachings of which are incorporated herein. A detailed 
analysis of the properties of the HPC-4 monoclonal is 
presented in Steams, et al., •'The Interaction of a Ca2+- 
Dependent Monoclonal Antibody with the Protein C 
Activation Peptide Region,** J. Biol. Chem. 263, 45 
826-832 (1988). 

The HPC-4 monoclonal antibody is directed against a 
peptide sequence present in the activation region of the 
heavy chain of Protein C and Ca2+. This peptide se- 
quence consists of twelve amino acids, glutamic acid- 50 
aspartic acid-glutamine-valine-aspartic acid-proline- 
arginine-leucine-isoleucine-aspartic acid-glycine-lysine 
(EDQVDPRLIDGK) (SEQUENCE 1). An 
advantage of this sequence is that it is short enough to 
be made synthetically but long enough to impart speci- 55 
ficity, thereby avoiding potential cross reactions be- 
tween the antibody and proteins other than the fusion 
protein. 

The antibody appears to have at least one metal ion 
binding site in addition to the peptide binding site. The 60 
peptide binding activity is responsive to, or "dependent 
on", binding at the metal ion binding site. The metal ion 
binding site is capable of binding to a divalent metal 
cation such as calcium, or a metal having a similar ionic 
radius and. coordination properties such as Tb^'*'. The 65 
peptide does not bind Ca^+and hence no Ca2+ binding 
site is added to the fusion protein. This will minimize 
nonspecific Ca^+mediated interaction with the isolation 
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matrix potentially inherent in the Prickett approach, 
described at page 3. In the case of tissue factor, it also 
means that calcium binding to factor VII (the ligand for 
tissue factor) can be studied without interference due to 
an additional metal binding site in the tissue factor fu- 
sion protein. 

When calcium binds to the metal ion binding site in 
the antibody, the monoclonal antibody becomes signifi- 
cantly more receptive to binding to the peptide. When 
a metal ion is not bound to the metal ion binding site of 
the monoclonal antibody, the antigen binding site is 
relatively unreceptive to binding the antigen. Accord- 
ingly, antibody-antigen binding may be controlled by 
varying the metal ion concentration in the media sur- 
rounding the antibody. 

Proteins to be Expressed and Purified. 
The method described herein is not limited as to the 
protein that can be expressed as a fusion protein, iso- 
lated using HPC-4, then separated from the epitope as 
the pure protein. 

Vectors and Expression Systems 
A vector is selected for expression of a sequence 
encoding the pelB leader peptide HPC-4 epitope and 
protein to be isolated, preferably separated by a specific 
protease cleavage site. Examples of suitable bacterial 
expression vectors which are commercially available 
include pcDNA II (Invitrogen #V400.20), pNHSa 
(Stratagene, #215201) and pBTacl (Boehringer Mann- 
heim, #1081365). These vectors are used for expression 
of full length or partial cDNA sequences in bacteria 
such as E. coll, where the expressed protein usually 
accimiulates as insoluble aggregates in the cytoplasm of 
bacteria called inclusion bodies. To extract the target 
protein, a high concentration (usually 8 M) of a chao- 
tropic agent such as urea is required to dissolve inclu- 
sion bodies. This process denatures the protein and 
results in the inactivation of the target protein. To ob- 
tain a functional protein, a refolding step is necessary 
which is usually very inefficient and difficult to control. 
Examples of commercially available mammalian ex- 
pression vectors include pRc/RSV (Invitrogen, 
#V780-20), pRc/CMV Onvitrogen #V750-20), and 
pMClNeO (Stratagene #213201). These expression 
vectors usually contain a suitable promoter that can 
direct high-level expression of recombinant proteins in 
mammalian cells and they also contain a drug resistant 
gene that can be used for selection of those mammalian 
cells that have integrated these vectors into their ge- 
nomes. These vectors are suitable for the expression of 
full length cDNA and any other DNA fragment which 
contains a leader peptide at the 5' end of the sequence. 
These constructs are transferred into a suitable expres- 
sion system, either procaryotic cells such as R coli, or 
eukaryotic cell, such as a yeast or mammalian cell cul- 
ture system. 

It is also possible to insert the cDNA encoding the 
fusion protein into an embryo for production of a trans- 
genic animal for production of the protein using known 
methodology. The protein expression can be targeted to 
a specific tissue using a tissue specific promotor in com- 
bination with the protein encoding sequence. For exam- 
ple, the fusion gene is isolated on 1% agarose gel fol- 
lowed by electroelution in a dialysis bag, as described 
by Maniatis, et al. (1982). The eluted DNA is precipi- 
tated, redissolved in water and purified by passing 
through an elutip-D column as per the instructions of 
the manufacturer (Schleicher and Schuell, Inc., Keene, 
NH). The purified DNA is dissolved in 5 mM Tris (pH 
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7.4) and 0.1 mM EDTA at 3 fig/ml concentration for 
nucroinjection. 

Mice or other suitable animals such as rabbits or 
sheep embryos arc obtained from commercial suppliers. 
Reagents such as bovine serum albumin, gelatin, and 5 
pronase are obtained from Sigma Chemical Co , St. 
Louis, MO. Hormones for superovulation, PMS and 
hCG, are obtained from Organon, Inc., NJ. Hyaluronic 
dase is purchased from Sigma. Restriction enzymes are 
obtained from New England Biolabs, Beverly, MA. 10 
The micromanipulator made by Nara Shige, USA, Inc.» 
Rainin Instruments Co., Wobum, MA can be used to 
microinject DNA into the pronuclei. DMEM, fetal 
bovine serum, and DPBS can be obtained from GIBCO 
Laboratories, Gaithersville, MD. 15 

For construction of transgenic , mice, procedures for 
embryo manipulation and microinjection are described 
in "Manipulating the Mouse Embryo" by B. Hogan, F. 
Costantini and £. Lacy (Cold Spring Harbor Labora- 
tory, 1986). Similar methods are used for production of 20 
other transgenic animals. Mouse zygotes are collected 
from six week old females that have been superovulated 
with pregnant mares serum (PMS) followed 48 hours 
later with human chorionic gonadotropin. Primed fe- 
males are placed with males and checked for vaginal 25 
plugs on the following morning. Pseudopregnant fe- 
males are selected for estrus, placed with proven sterile 
vasectomized males and used as recipients. Zygotes are 
collected and cumulus cells removed by treatment with 
hyaluronidase (1 mg/ml). Pronuclear embryos are re- 30 
covered from female mice mated to males. Females are 
treated with pregnant mare serum, PMS, (5 lU) to in- 
duce follicular growth and human chorionic gonadotro- 
pin, hCG (51 U) to induce ovulation. Embryos are re- 
covered in a Dulbecco's modified phosphate buffered 35 
saline (DPBS) and maintained in Dulbecco's modified 
essential medium (DMEM) supplemented with 10% 
fetal bovine serum. 

Microinjections can be performed using Narishige 
micromanipulators attached to a Nikon diaphot micro- 40 
scope. Embryos are held in 100 microliter drops of 
DPBS under oil while being liiicroinjected. DNA solu- 
tion is microinjected into the largest visible male pronu- 
cleus. Successful injection is monitored by swelling of 
the pronucleus. Immediately after injection embryos are 45 
transferred to recipient females, mature mice mated to 
vasectomized male mice. Recipient females are anesthe- 
tized using 2,2,2-tribromoethanol. Paralumbar incisions 
are made to expose the oviducts and the embryos are 
transformed into the ampullary region of the oviducts. 50 
The body wall is sutured and the skin closed with 
wound clips. Recipients are appropriately ear notched 
for identification and maintained until parturition. 

At three weeks of age about 2-3 cm long tail samples 
are excised for DNA analysis. The tail samples are 55 
digested by incubating overnight at 55* C. in the pres- 
ence of 0.7 ml 50 mM Tris, pH 8.0, 100 mM EDTA, 
0.5% SDS and 350 yt,g of proteinase K. The digested 
material is extracted once with equal volume of phenol 
and once with equal volume of phenol :chloroform (1:1 60 
mixture). The supematants are mixed with 70 fil 3 M 
sodium acetate (pH 6.0) and the DNAs are precipitated 
by adding equal volume of 100% ethanol. The DNAs 
are spun down in a microfuge, washed once with 70% 
ethanol, dried and dissolved in 100 fiL TE buffer (10 65 
mM Tris, pH 8.0 and 1 mM EDTA). 10 to 20 fil of 
DNAs were cut with BamHI and Bglll or EcoRI, 
electrophoresed on 1% agarose gels, blotted onto nitro- 



cellulose paper and hybridized with 32p.iabeled DNA 
sequences. Transgenic animals are identified by autora- 
diography. 

The transgenic females are mated. At five days fol- 
lowing parturition milk samples were taken and assayed 
for the fusion protein. At six to seven weeks of age 
transgenic males are mated. The Fl litters are analyzed 
for transgene.'The positive females are kept and mated 
at five weeks of age. At five days following parturition 
milk samples are assayed for the fusion protein. Milk 
samples (30-200 pA) are collected from anesthetized 
mice injected with 0.05 units of oxytocin, an inducer of 
lactation. The milk is collected in a glass capillary with 
the aid of mammary palpation. The fusion protein is 
then isolated by binding to the HPC-4 antibody. 

Purification using the HPC-4 antibody-epitope. 

The antibody can be bound to a variety of substrates, 
for use in purification and isolation of the fusion protein, 
including agarose, acrylamide and other types of con- 
ventional chromatographic resins, filters, etc. These 
materials are known to those skilled in the art, as are the 
methods for attaching the protein to them. The selec- 
tion of the material will depend in large part on the scale 
of the purification or the sample to be analyzed, as well 
as biocompatibility and government agency approval 
where the end-product is for pharmaceutical use. 

Protease Cleavage Site. 

In the most preferred embodiment, the fusion protein 
includes a protease cleavage site between the epitope 
and the protein to be isolated. Suitable sites include 
sequences cleaved by Factor Xa: He Glu Gly Arg 
(lEGR), enterokinase: Asp Asp Asp Asp Lys 
(DDDDK), and thrombin: Phe/Gly Pro Arg (F/GPR). 

Following purification with the HPC-4, the fusion 
protein is treated with the appropriate enzyme to cleave 
the binding peptide from the desired protein. 

The present invention will be further understood by 
reference to the following non-limiting examples. 

EXAMPLE 1 

Construction of a vector for expression of a fusion 
truncated tissue factor. 

Blood coagulation results from the production of 
thrombin, a proteolytic enzyme inducing platelet aggre- 
gation and cleaving fibrinogen to fibrin, which stabilizes 
the platelet plug. A number of proenzymes and proco- 
factors circulating in the blood interact in this process 
through several stages during which they are sequen- 
tially or simultaneously . converted to the activated 
form, ultimately resulting in the activation of prothrom- 
bin to thrombin by activated factor X (fXa) in the pres- 
ence of factor Va, ionic calcium, and platelets. 

Factor X can be activated by either of two pathways, 
termed the extrinsic and intrinsic pathways. The intrin- 
sic pathway, or surface-mediated activation pathway, 
consists of a series of reactions where a protein precur- 
sor is cleaved to form an active protease, beginning 
with activation of factor XII to factor Xlla, which 
converts factor XI to factor XIa, which, in the presence 
of calcium, converts factor IX to factor IXa. Factor IX 
can also be activated via the extrinsic pathway by tissue 
factor (TF) in combination with activated factor VII 
(factor Vila; fVIIa). The activated factor IX, in the 
presence of calcium, phospholipid (platelets), and factor 
Villa, activates factor X to factor Xa. 

Physiologically, the major pathway involved in coag- 
ulation is believed to be the extrinsic pathway, an essen- 
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tial step of which is activation of factor VII to factor 
Vila. Clotting assays and other activity assays designed 
to measure factor VII and Vila generally must employ 
TF, the cofactor required for factor Vila coagulant 
activity. Most commonly, TF is provided as a relatively 5 
crude preparation known as thromboplastin. Tissue 
factor is an integral membrane glycoprotein having a 
protein and a phospholipid component. It has been 
isolated from a variety of tissues and species and re» 
ported to have a molecular mass of between 42,000 and 10 
53,000. DNA encoding tissue factor and methods for 
expression of the protein have now been reported, for 
example, in European Patent Application 0 278 776 by 
Genentech, Inc. and by J. H. Morrissey, et al. Cell 50, 
129-135 (1987). 15 

The nucleotide (SEQUENCE 2) and amino acid 
(SEQUENCE 3) sequence of truncated tissue factor 
(tTF) is shown below, which, as described below, was 
modified from the sequence described in U.S. Ser. No. 
07/683,682 filed Apr 10, 1991, the teachings of which 20 
are incorporated herein. The truncated tissue factor 
protein lacks the predicted transmembrane and cyto- 
plasmic domains of tissue factor. The essential differ- 
ence between truncated tissue factor and wild-type 
tissue factor is that truncated tissue factor is no longer 25 
tethered to the phospholipid membrane surface. Soluble 
tissue factor is a cofactor for activated factor VII 
(FVII) but not precursor factor VII (FVII). Intact tis- 
sue factor is a cofactor for FVII and FVIIa. The Smal 
site at the 5! end, and the Xbal site at the ^' end are 30 
underlined. The first three nucleotides of the Smal site 
are removed upon digestion with the restriction en- 
zyme, Smal. This permits the tTF cDNA sequence to 
be blunt-end ligated to StuI site of the pIN-III-pelB- 
HPC-4 expression vector in a manner that preserves the 35 
reading frame of the tTF cDNA clone. 
Construction of pIN-III-pelB Expression Vector: 
This vector is derived from pIN-III-ompA (John 
Ghrayeb et al, EMBO J. 3(10):2437-2442 (1984)). 
Cleavage of pIN-III-ompA with Xbal and BamHI re- 40 
moves a DNA fragment which contains the Shine-Dai- 
gamo sequence GAGG, and the entire nucleotide se- 
quences encoding for the ompA signal peptide. As 
shown in FIG. 1, by ligation of 8 overlapping oligonu- 



which encodes two extra residues Glu, Phe (+1 and 2). 
The DNA fragment also encodes a 12 amino acid resi- 
due long peptide (residue 3 to 14), which is the epitope 
for the Ca++dependent monoclonal antibody HPC-4. 
The EcoRI restriction site separates the HPC-4 epitope 
from the pelB signal peptide. Following the nucleotides 
encoding the epitope, are the nucleotides encoding the 
four amino acid residues He, Glu, Gly, and Arg, form- 
ing a factor Xa cleavage site (SEQUENCE 6). 

The factor Xa cleavage site is followed by a sequence 
containing several restriction enzyme sites for cloning 
of the target gene into this vector for expression. As 
shown in FIG. 1, this DNA fragment contains an Xbal 
sticky end at the 5' end and a BamHI sticky end at the 
3' end for ligation to the Xbal and BamHI sites of pIN- 
III-ompA. Multiple cloning sites of pIN-III-pelB in- 
cludes StuI, Not I, Hindlll, and BamHI. StuI is a 6 bp 
blunt end cutter whose recognition site is AGG CCT; 
when it cleaves this site it leaves three nucleotides at the 
3' end of the DNA fragment, AGG, which encode the 
last residue in the FXa cleavage site, Arg. Therefore, 
double digestion of this vector with StuI and either one 
of the other 3' end cloning sites (NotI or Hindlll or 
BamHI) provides a suitable directional cloning of the 
target gene for expression. 

The oligonucleotide sequence of a synthetic DNA 
fragment encoding the pelB leader peptide (AA-22 to 
-1) (SEQUENCE 4); HPC-4 epitope (AA +3 to 14) 
(SEQUENCE 1); and the FXa cleavage site (AA 15 to 
18) (SEQUENCE 6), are shown below. The Shine-Dal- 
gamo sequence GAGG (S-D) is overlined. The restric- 
tion enzyme sites useful for cloning have been outlined, 
which are EcoRI (between the pelB leader sequence 
and the HPC-4 epitope which creates two extra amino 
adds +1 and -f 2), StuI, NotI, and Hindlll. The pelB 
leader peptide cleavage site which is cleaved by bacte- 
rial signal peptidase is shown by an arrow. This DNA 
fragment was made by ligation of eight overlapping 
oligonucleotides so that it created a sticky Xbal site at 
the 5' end and a sticky BamHI site at the 3' end for 
ligation into the Xbal and BamHI site of the pIN-III- 
ompA vector. The boundary of oligonucleotides for the 
sense strand and at the bottom for the complementary 
antisense strand is shown by bold letters. 



-22 

Xbal end M 

S'-CTAGATAACGAGGGCAAAAAATGAAATACCTATTGCCTACGGCAGCCGC 
3'-TATTGCTCCCOTTTTTTACTTTATGGATAACGGATGCCGTCGGCG 

-1 i+1 

TGGATTGTTATTACTCGCTGCCCAACCAGCCATGGCCGAATTCGAAGAT 
ACCTAACAATAATGAGCGACGGGTTGGTCGGTACCGGCTTAAGCTTCTA 

EcoRI 

CAGGTAGATCCGCGGTTAATCGATGGTAAGATTGAAGGAAGGCCTAGGC 
GTCCATCTAGGCGCCAATTAGCTACCATTCTAACTTCC TTCCGGATCCG 

StuI 

GGCCGCAAGCTTG-3' 
CCGGCGTTCGAAC CTAG-3' BamHI end 
NotI Hindlll 



cleotides (four sense and the other four complementary 
antisense), a DNA fragment was synthesized which 
contains the missing Shine-Dalgamo sequence followed 
by oligonucleotides sequence encoding a 22 residue 
long peptide representing the pelB signal peptide (Sahu- 
Ping Lei, et al, J. Bacteriol. 169(9):4379-4383, 1987) 
(AA -22 to -1) (SEQUENCE 4). 

An EcoRI restriction site was included in this DNA 
fragment, immediately after the pelB leader sequence. 



Construction of pIN-III-pelB-tTF 

The construction of pIN-III-pelB-tTF is shown sche- 
matically in FIG. 2. pJH27 is a plasmid that contains the 
entire truncated tissue factor (tTF) cDNA sequence 
cloned between BamHI and Xbal restriction sites of the 
vector pGEM-7Zf(-{-) (Promega). Preparation of the 
tTF cDNA fragment with 5' and 3' ends compatible 
with cloning sites in the pIN-III-pelB expression vector 
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was performed in a two-step process. At the first suge 
the pJasmid pJH27 was digested with Bbvl restriction 
enzyme which cleaved away DNA sequences from the 
S' end of the cDNA coding for the tTF signal peptide 
and the first 11 residues of the mature tTF protein. In S 
order to repair the tTF gene two complementary oligo- 
nucleotides were synthesized, which coded for the 1 1 
missing residues and created a blunt, half-Smal site at 
the 5' end. This oligonucleotide contained a sticky end 
that could hybridize to, and therefore be ligated to, the 10 
sticky end of the tTF cDNA sequence created by Bbvl 
digestion. 

In the second stage, the modified cDNA insert was 
removed from pJH27 by digestion with Xbal, and the 
insert was resolved by agarose gel electrophoresis and 15 
purified by elution from the agarose gel. The resulting 
cDNA fragment which encodes the entire tTF was 
subcloned into the Smal and Xbal sites of pUC19 plas- 
mid. As shown in FIG. 2, the tTF cDNA fragment then 
was removed from pUC19 with Smal and Hindlll re- 20 
striction enzymes and subcloned into the StuI and Hin- 
dlll sites of the pIN-III-pelB expression vector. 

EXAMPLE 2 

Expression and Isolation of the Fusion Protein 25 

Growth of Bacteria for Pcriplasmic Extract The 
XLl-B strain of E. coli was grown at 37" C. in LB me- 
dia. 500 >il of overnight culture was transferred to a 125 
ml flask containing 25 ml of LB and 100 fig/ml ampicil- 
lin. The flask was shaken at IT C. for 2-3 hrs until the 
OD600 is equal to 0.6 to 0.7. One liter LB media with 
ampicillin (Amp) was inoculated with the entire 25 ml 
bacterial culture and the shaking was continued at 37° 
C. until the OD600=0.6 to 0.7. The culture then was 
induced with 1 mM isopropyl-B-D-thiogalactopyrano- 
side (IPTG) and the shaking was continued for another 
8 hrs at room temperature. The bacterial culture was 
centrifuged at approximately 3000 rpm for 20 min to 
separate the culture medium from the cells. The cell 
pellet was resuspended with 50 ml of cold water and ^ 
incubated for i hour on ice with shaking. The periplas- 
mic extract was then collected by spinning at 10,000 g 
for i hr. 

Approximately three-fourths of the protein was ex- 
pressed in the media of the JET. coli, and one-fourth was 
recovered from the periplasmic space after hypotonic 
shock. The periplasmic extract was mixed with the 
culture medium (which has been clarified by centrifuga- 
tion, as described above), and the mixture was concen- 
trated to approximately one- tenth the original volume ^ 
using an ultrafiltration spiral cartridge concentrator 
with a 3000 MW cut-off membrane (Amicon). Follow- 
ing concentration, this material was brought up to 0. 1 M 
NaCl, 0.02 M Tris and 1 mM calcium chloride, and 
loaded on a HPC-4 column equilibrated with the same 
buffer (5 mg/ml HPC-4 IgG immobilized on Affi- 
Gel TM 10; 4 ml total used for 1 liter of starting bacterial 
culture), washed with approximately 200 ml 1 M NaCl, 
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0.02 M Tris-HCl, pH 7.5 conuining 1 mM CaCh, fol- 
lowed by approximately 10 to 20 ml of the same bufTer 
but with 0.1 M NaCl, and the protein eluted with 0.1 M 
NaCl, 0.02 M Tris-HCl, 5 mM EDTA, pH 7,5. Elution 
of the tTF from the column was monitored by absor- 
bance of light at 280 nm; a single peak of protein was 
observed. 
SDS-PAGE 

SDS-PAGE analysis (10% acrylamide using La- 
emmli system, Laemmli, Nature 227:680-685 (1970)) of 
the peak fraction from the affinity chromatography 
indicated a single monomeric band at around 30 kd in 
both reducing and non-reducing conditions, as shown in 
FIG. 3. This is consistent with the expected molecular 
weight of tTF when it is not glycosylated. 

Approximately 1 mg of soluble tissue factor is recov- 
ered from 1 liter of starting bacterial culture, and is 
homogeneous as shown by SDS gel electrophoresis. 
The tissue factor runs as monomer in the gel even with- 
out disulfide bond reduction. The soluble, truncated 
tissue factor protein isolated in this manner has full 
cofactor activity toward factor Vila even without the 
HPC-4 epitope removed; this activity is equivalent to 
that of truncated tissue factor without the HPC-4 epi- 
tope expressed in mammalian cells, as shown by FIG. 4. 

Functionally the protein is equivalent to the protein 
expressed in mammalian cell culture, but at less than 
10% the cost. This functional equivalence is demon- 
strated by the factor Vila concentration dependence on 
factor X activation and by the ability of the tissue factor 
from both sources to increase factor Vila amidolytic 
activity equivalently. 

EXAMPLE 3 

Isolation of other proteins using the fusion protein 
technology in combination with the HPC-4 epitope. 

Thrombomodulin fTdi^mts^ EGF domains of factor 
X and EGF domains of protein C with the epitope 
linked to the amino terminal region of these proteins 
have been expressed in and recovered from other E. coli 
periplasmic space expression systems. A fragment of 
thrombomedulin of the 4th-14 6th EGF domains has 
also been isolated from cultured mammalian cells in a 
single step purification on the antibody column as de- 
scribed above. In this case, the DNA encoding the 
thrombomedulin 4th-6th EGF domains was amplified 
from cDNA by PCR and ligated to synthetic oligonu- 
cleotides coding for the transferrin signal peptide fol- 
lowed by the HPC-4 epitope and Xa cleavage site. The 
resulting DNA fragment was subcloned into a 
pRc/RSV mammalian cell expression vector for ex- 
pression in human 293 cells. 

Modifications and variations of the method and com- 
positions described herein will be obvious to those 
skilled in the art from the foregoing detailed descrip- 
tion. Such modifications and variations are intended to 
come within the scope of the following claims. 



SEQUENCE USTING 



( I ) GENERAL INFORMATION: 

( i i i ) NUMBER OF SEQUENCES: 9 



( 2 ) INFORMATION FOR SEQ ID NO: I: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: tingle 
(D)TOPOLOCY: linear 

( i i ) MOLECULE TYPE: peptide 

(lit ) HYPOTHETICAL: NO 

< i V ) ANTI-SENSE: NO 

( V ) FRAGMENT TYPE: internal 

( V 1 ) ORIGINAL SOURCE: 

( A ) ORGANISM: Mouse 

( i a ) FEATURE: 

( A ) NAME/KEY: Binding-site 
( B ) LOCATION: 1..12 

( D ) OTHER INFORMATION: /notes "Epitope recognized by HPC4 
antjprotein C antibody" 

( a i ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GIu Asp Gin Val Asp Pro Arg Leu lie Asp Oly Lys 
I S 10 

( 2 ) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 672 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Unear 

( i i ) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL: NO 

( i V ) ANTI-SENSE: NO 

( V i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Homo sapiens 

( i a ) FEATURE: 

( A ) NAME/KEY: miscrecomb 
( B ) LOCATION: I..6 

( i X ) FEATURE: 

( A ) NAME/KEY: miscrecomb 
( B ) LOCATION: 67..72 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



CCCGGGACCA 


CCAATACTGT 


GGCAGCATAT 


AATTTAACTT 


GGAAATCAAC 


TAATTTCAAG 


6 0 


ACAATTTTGG 


AGTGGGAACC 


CAAACCCGTC 


AATCAAGTCT 


ACACTGTTCA 


AATAAOCACT 


1 2 0 


AAGTCAGGAG 


ATTGGAAAAG 


CAAATGCTTT 


TACACAACAG 


ACACAGAGTG 


TGACCTCACC 


1 S 0 


GACGAGATTG 


TGAAGGATGT 


GAAGCAGACG 


TACTTGGC AC 


GGGTCTTCTC 


CTACCCGGCA 


2 4 0 


GGGAATGTGG 


AGAGCACCGG 


TTCTGCTGGG 


GAGCCTCTGT 


ATGAGAACTC 


CCCAGAGTTC 


3 0 0 


ACACCTTACC 


TCCAGACAAA 


CCTCGGACAG 


CCAACAATTC 


AGAOTTTTGA 


ACAGGTGGGA 


3 6 0 


ACAAAAGTGA 


ATGTGACCGT 


AGAAGATGAA 


CGGACTTTAG 


TCAGAAGGAA 


CAACACTTTC 


4 2 0 


CTAAGCCTCC 


GGGATGTTTT 


TGGCAAGGAC 


TTAATTTATA 


CACTTTATTA 


TTGGAAATCT 


4 8 0 


TCAAGTTCAG 


GA AAGAAA AC 


AGCCAAAACA 


AACACTA ATG 


AGTTTTTGAT 


TGATGTGGAT 


3 4 0 


A AAGCAGAAA 


ACTACTGTTT 


CAGTGTTCAA 


GCAGTGATTC 


CCTCCCGAAC 


AGTTAACCGG 


6 0 0 


AAGAGTACAG 


ACAGCCCGGT 


AGAGTCTATG 


GGCCAGGAGA 


AAGGGGAATT 


T AGAGAATAA 


6 6 0 


CTGCAGTCTA 


GA 










6 7 2 



( 2 ) INFORMATION FOR SEQ ID NO:3: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 2i8 amino acids 
( B ) TYPE: amiiKt acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i t i ) HYPOTHETICAL: NO 

( i V ) ANTI-SENSE: NO 

( V ) FRAGMEKT TYPE: N-tcnninal 

( V } ) ORIGINAL SOURCE: 

( A ) ORGANISM: Homo sapiens 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Gly Tbr Tbr Asn Thr Val Ala Ala Tyr Asn Leu Thr Trp Lys Ser Thr 

1 5 10 15 

Asn Phe Lys Thr lie Leu Glu Trp GIu Pro Lys Pro Val Asn Gin Val 
2 0 2 5 3 0 

Tyr Thr Val Gin lie SerThr Lys Ser Gly Asp Trp Lys Ser Lys Cys 
3 5 4 0 4 5 

Phe Tyr Thr Thr Asp Thr Glu Cys Asp Leu Thr Asp Glu lie Val Lys 
SO S S 6 0 

Asp Val Lys Gin Thr Tyr Leu Ala Arg Val Phe Ser Tyr Pro Ala Gly 

63 70 75" 80 

Asn Val Glu Ser Thr Gly Ser Ala Gly Glu Pro Leu Tyr Glu Asn Ser 
8 5 9 0 9 5 

Pro Glu Phe Thr Pro Tyr Leu Glu Thr Asn Leu Gly Gin Pro Thr lie 
10 0 10 5 110 

Gin Ser Phe Glu Gin Val Gly Thr Lys Val Asn Val Thr Val Glu Asp 

115 12 0 12 5 

Glu Arg Thr Leu Val Arg Arg Asn Asn Thr Phe Leu Ser Leu Arg Asp 

1 3 0 1 3 5 1 4 0 

Val Phe Gly Lys Asp Leu lie Tyr Thr Leu Tyr Tyr Trp Lys Ser Ser 
14 5 1 5.0 15 5 16 0 

Ser Ser Gly Lys Lys Thr Ala Lys Thr Asn Thr Asn Glu Phe Leu lie 
16 5 17 0 17 5 

Asp Val Asp Lys Gly Glu Asn Tyr Cys Phe Ser Val Gin Ala Val lie 
18 0 18 5 19 0 

Pro Ser Arg Thr Val Asn Arg Lys Ser Thr Asp Ser Pro Val Glu Cys 

195 200 205 

Met Gly Gin Glu Lys Gly Glu Phe Arg Glu 

2 10 2 15 

( 2 ) INFORMATION FOR SEQ ID NO:4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 160 base pairs 
( B > TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL: NO 

( i V ) ANTI-SENSE: NO 

( V i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Escherichia coli 

( i X ) FEATURE: 

( A ) NAME/KEY: miscfcature 
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( B ) LOCATTON: ia.l3 

( D } OTHER INFORMATION: /Bote«>**SluDe-Dtl8»rm> sequence" 

( i k ) FEATURE: 

( A ) NAME/KEY: miscfeature 
( B ) LOCATION: 2Z..SS 

< D ) OTHER INFORMATION: /note«**pdB leider peptide** 

< i a ) FEATURE: 

( A ) NAME/KEY: miscfeature 
( B )LOCATION:3..M 

( D ) OTHER INFORMATION: /oote»'*HPC4 epitope" 

( i I )FEATURE: 

( A ) NAME/KEY: miscfeature 
( B ) LOCATION: 15.. 1 8 

( D ) OTHER INFORMATION: /note = "Factor Xa cleavage site" 

( i X ) FEATURE: 

( A ) NAME/KEY: miscfeature 
( B ) LOCATION: 146.. 152 

( D ) OTHER INFORMATION: /note = "Not! rcstiictioa enzyme" 
( a i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTAGATAACG AGGGCAAAAA ATGAAATACC TATTGCCTAC GCCAGCCGCT GGATTGTTAT 60 
TACTCGCTOC CCAACCAGCC ATGGCCGAAT TCGAAGATCA GGTAGATCCG CGGTTAATCG 120 
ATGGTAAGAT TOAAGGAAGG CCTAGGCGGC CGCAAGCTTG 160 

( 2 ) INFORMATION FOR SEQ ID NO:S: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 160 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL: NO 

. ( t V ) ANTI-SENSE: NO 

( V i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Escherichia coli 

( i X ) FEATURE: 

( A ) NAME/KEY: miscfeature 
( B ) LOCATION: 83..88 

( D ) OTHER INFORMATION: /notees**EcoRI restriction enzyme 
site" 

( i X ) FEATURE: 

( A ) NAME/KEY: miscfeature 
( B ) LOCATION: 1 34.. 139 

( D ) OTHER INFORMATION: /noteB**Siul restriction enzyme 
site" 

( i X ) FEATURE: 

< A ) NAME/KEY: miscfeature 
( B ) LOCATION: 150 . 155 

( D ) OTHER INFORMATION: /note = "Hindlll restriction enzyme" 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
TATTOCTCCC GTTTTTTACT TTATGGATAA CGGATGCCCT CGGCGACCTA ACAATAATGA 6 0 

GCGACGGGTT GGTCGGTACC GGCTTAAGCT TCTAOTCCAT CTAGGCGCCA ATTAOCTACC 120 
ATTCTAACTT CCTTCCGGAT CCGCCGGCOT TCGAACCTaG 1 6 0 

( 2 ) INFORMATION FOR SEQ ID NO:6: 

< i- ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 4 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: peptide 

( i i i ) KYPOTHETICAL: NO 

( i V ) ANTl-SENSE: NO 

( i X ) FEATURE: 

( A ) NAME/KEY: Ck»v««e-«te 
( B ) LOCATION: 1..4 

( D ) OTHER INFORMATION: /nole« "Factor X« aeavage Site" 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

lie GIu Cly Arg 
I 



( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: S anuBO acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: siasle 

( D ) TOPOLOGY; linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( i V ) ANTLSENSE: NO 

( i X ) FEATURE: 

( A ) NAME/KEY: Cleavage-site 
( B ) LOCATION: I. .5 

( D ) OTHER INFORMATION: /aote=r"Enterokina$e Oeava^e Site*' 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Asp Asp Asp Asp Lys 
I 5 



( 2 ) INFORMATION FOR SEQ ID NO:8: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3 imino acids 
( B > TYPE: amino acid 
( C > STRANDEDN-ESS: single 
( D ) TOPOLOGY: Unear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( i V ) ANTI-SENSE: NO 

( i X ) FEATURE: 

( A ) NAME/KEY: Cleavage-site 
( B ) LOCATION: 1..3 

( D ) OTHER INFORMATION: /note » "Thrombin Cleavage Site" 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
Phe Pro Arg 



( 2 ) INFORMATION FOR SEQ ID NO:9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3 amino acids 
< B ) TYPE: amino acid 
. ( C ) STRANDEDNESS: single 
( D > TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 



( i V ) ANTI-SENSE: NO 
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( t X ) FEATURE: 

( A ) NAME/KEY: Oeavage^e 
( B ) LOCATION: 1..3 

( D ) OTHER INFORMAI ION: /ootea^Tluonibia Qeavage Site* 

( X i ) SEQUENCE OESCRIITION: SEQ ID NO:9: 

Gly Pro Arg 
1 



We claim: 

1. A fusion protein comprising an epitope which is 
capable of binding in the presence of calcium to the 
monoclonal antibody designated as HPC-4 (ATCC No. 15 
HB 9892) and a protein to be isolated by binding of the 
fusion protein to HPC-4 antibody, wherein the epitope 
and the protein to be isolated are separated by a specific 
protease cleavage site. 

2. The fusion protein of claim 1 wherein the epitope 20 
consists of the amino acid sequence EDQVDPRL 

I D G K (SEQ ID NO: 1). 

3. The fusion protein of claim 1 wherein the protease 
cleavage site is selected from the group consisting of 
amino acid sequences specifically cleaved by Factor 25 
Xa; He Glu Gly Arg (lEGR. SEQ ID NO; 6), enteroki- 
nase: Asp Asp Asp Asp Lys (DDDDK, SEQ ID NO: 



7), and thrombin: Phe/Gly Pro Arg (F/GPR, SEQ ID 
NO: 8 and SEQ ID NO: 9). 

4. The fusion protein of claim 1 wherein the protein 
to be isolated is selected from the group consisting of 
tissue factor and a protein corresponding to the fourth 
through sixth epidermal growth factor domains of 
thrombomodulin. 

5. The fusion protein of claim 1 consisting essentially 
of the amino acid sequence EDQVDPRLIDGK 
(SEQ ID NO: 1) at the N-terminus, an amino acid se- 
quence specifically cleaved by factor Xa, and tissue 
factor. 

6. The fusion protein of claim 5 wherein the tissue 

factor is the extracellular domain of tissue factor. 
• • • • • 
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Recombinant murine MRP 14 {mMRP14) was pro- 
duced in Escherichia coli using the pGEX expression 
system. The mass of fusion protein, by electrospray 
ionization-mass spectrometry (ESI/MS), was 39,213 Da 
which compares well with the theoretical mass 
(39,210.4 Da), Thrombin digestion of fusion protein 
was expected at a cloned thrombin consensus se- 
quence ( . . . LVPRGS . . . ) located between glutathione 
S-transferase and mMRP14. Analysis of products of 
digestion by C4 reverse-phase HPLC and SDS-PAGE/ 
Western blotting revealed two immunoreactive cleav- 
age products with molecular weights around 13,000. 
Masses of the two proteins determined by ESI/MS 
were 13,062 and 11,919 Da. The larger product corre- 
sponded to the expected mass of recombinant 
mMRP14 (13,061.9 Da). Analysis of the protein se- 
quence of recombinant mMRP14 revealed a thrombin- 
like consensus sequence ( . . . NNPRGH . . . ) located 
close to the C-terminus. The smaller protein corre- 
sponded to a truncated form of rec mMRP14 (rec 
MRP14,_,o2) with a calculated mass of 11,918.6 Da. Op- 
timization of the cleavage conditions resulted in >95% 
full-length rec mMRP14. Native mMRP14 contains one 
intramolecular disulfide bond between Cysyg and 
Cysgo. The full-length recombinant protein was rena- 
tured and oxidized in ammonium acetate (pH ~ 7) for 
96 h and formed >95% of the native intramolecular 
disulfide-bonded form. MRP14,_io2 bound substantially 
less ®^Zn^* compared to native mMRP14 or rec 
mMRP14 after transfer to polyvinylidene difluoride 
and incubation with "ZnCU, implicating the His resi- 
dues located within the C-terminal domain in Zn^"^ 
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MRP 14 [migration inhibitory factor (MIF) -related 
protein, 14 kOa], also known as pi 4, LI heavy chain, 
calgranulin B, and S100A9 (1), is a small acidic protein 
containing two Ca^^-binding EF hands belonging to the 
highly conserved SlOO protein family (2,3). Some mem- 
bers also bind zinc with high affinity (1). They may 
influence diverse cellular processes including cell cycle 
progression, cell differentiation, regulation of protein 
phosphorylation, and cytoskeletal-membrane interac- 
tions (4,5). The C-terminal region of some members of 
the SlOO family is extended. A peptide with sequence 
identity to the C-terminal domain of human MRP 14 
(hMRP14) has potential function (6). This region is 
identical to part of the sequence of neutrophil-immobi- 
lizing factor (NIF-1) which inhibits neutrophil chemo- 
taxis to C5a and fMetLeuPro (FMLP) (6,7), but the 
precise role of hMRP14 in leukocyte migration is un- 
clear. Recent experiments indicate a role in leukocyte 
adhesion (8). Human MRP 14 has antimicrobial activ- 
ity, possibly caused by its ability to sequester Zn^^ from 
the surrounding environment (9,10). A potential zinc- 
binding domain, located near the (i-terminus of 
hMRP14. was identified (11). 

We previously isolated CP 10 (chemotactic protein, 
10 kOa) and mMRP14 from supernatants of activated 
murine spleen cells (12,13). ESI/MS and Edman se- 
quencing indicated that CP 10 was not posttranslation- 
ally modified, whereas mMRP14 had several modifica- 
tions including a disulfide bond between Cysyg and 
Cysgo- The protein contains an additional free Cys res- 
idue at position 110 (13). Functional recombinant 
CP 10 was produced using the pGEX expression system 
(14,15). 

Formation of disulfide bonds in numerous recombi- 
nant proteins at identical Cys residues to those iden- 
tified within the native protein is essential for biologi- 
cal activity (16-18). The importance of disulfide bonds 
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for functional activity in SI 00 proteins is indicated for 
SlOOp, where mitogenic activity is dependent on a 
specific intramolecular disulfide bond between Cyses 
and Cys84 (19), 

To facilitate biochemical, structural, and functional 
characterization, a large-scale source of mMRPI4 was 
obtained using the pGEX recombinant protein expres- 
sion system. Here we describe the isolation and char- 
acterization of two forms of the protein and the oxida- 
tive refolding to produce recombinant mMRPI4 with 
the intramolecular disulfide bridge in the same posi- 
tion as in the native protein. A comparison of the 
affinity of native mMRP14, rec mMRP14, and rec 
MRP14i_io2 for Zn^^ was also performed. 

MATERIALS AND METHODS 

General. Reagents and chemicals were analytical 
grade (Sigma, St. Louis, MO; Bio-Rad, Hercules. CA) 
and solvents were HPLC grade (Mallinckrodt, Clayton 
South, Victoria, Australia), SDS-PAGEAA/estern blot- 
ting were performed using a Mini Protean II apparatus 
(Bio-Rad) with 15% gels and a Tris/Tricine buffer sys- 
tem (20). Native mMRP14 was obtained from activated 
spleen cell supernatants as previously described (13). 
Liquid chromatographic separations were performed 
using a nonmetallic LC626 HPLC system (Waters, 
Bedford, MA) and UV absorbance was monitored at 
214 and 280 nm with a Waters 996 photodiode array 
detector or 490 UV/visible detector. 

Overexpression and characterization of recombinant 
mMRP14. The coding sequence for mMRP14 (21) was 
amplified by PCR from a cDNA plasmid clone, pm- 
MRP14, and cloned between BaniHl and EcoRl sites of 
the glutathione S-transferase expression vector 
pGEX2T according to the manufacturer's instructions 
(Pharmacia Biotech, Uppsala, Sweden). The mMRP14 
expression vector, pLC14, was used to transform com- 
petent E. caii (strain BL-21). Transformed bacteria 
were cultured with IPTG (100 /jlM) for 3-4 h to induce 
fusion protein production (14,15). Recombinant 
mMRP14 was isolated after cell lysis (3X freeze/thaw) 
in TBS/T (25 mM Tris, 250 mM NaCl, pH 7.5, 1% 
Triton X-100) using glutathione-agarose beads (Sigma) 
and cleaved with thrombin (Calbiochem, San Diego. 
CA; 20 NIH units/liter culture) in TBS (4 ml, 60 min, 
37°C). Beads were washed with TBS (2X2 ml), the 
combined eluate (500 /xl) was applied to a C4 RP-HPLC 
column (5 /xm, 300 A, Vydac, Hesperia, CA), and pro- 
teins were eluted in a linear gradient of 25 to 70% 
actetonitrile (0,1% TFA) at 1 ml/min over 30 min. Pro- 
teins were collected manually and concentrated using a 
Speed Vac (Savant, Farmingdale, NY) to a final concen- 
tration of approximately 50 ngZ/xl. Samples (approxi- 
mately 200 ng) were analyzed by SDS-PAGE; gels 
were blotted onto polyvinylidene difluoride (Immobilon 
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P. Millipore, Bedford, MA) at 75 V for 25 min before 
Western analysis. Semipreparative RP-HPLC was per- 
formed after reduction (5 mM DTT) and filtration (2 
/xm, Durapore, Millipore). The lysate (4 ml) was loaded 
onto a C8 RP-HPLC column (Prep- 10. 20 /xm, 10 X 100 
mm, Brownlee/ABI, Alltech, XL) and rec mMRP14 was 
isolated after application of a 35 to 50% acetonitrile 
(0.1% TFA) gradient over 20 min at 3 ml/min. Recom- 
binant MRP14,_io2 was isolated after prolonged (24 h) 
exposure to thrombin (Calbiochem; 50 NIH units/liter 
culture) in TBS (4 ml) at 37X. The beads were washed 
with TBS (2X1 ml) and combined and then the trun- 
cated protein was isolated as above. 

Refolding of recombinant mMRP14, Recombinant 
mMRP14 (—250 /xg, 3 ml) isolated by preparative C8 
RP-HPLC was reduced to approximately 50% of the 
initial volume and then dialyzed for 96 h against am- 
monium acetate (5 mM, pH 7, 2 liters) using 2000 
MWCO Slide-a-Lyser dialysis cassettes (Pierce, Rock- 
ford, IL). The extent of refolding was assessed by C4 
RP-HPLC (5 /utm, 4,6 X 250 mm, Vydac) using a linear 
gradient of 25 to 70% acetonitrile (0.1% TFA) over 30 
min at 1 ml/min at ~24-h intervals. Samples collected 
from C4 RP-HPLC were analyzed by SDS-PAGE and 
ESI/MS, 

CarboxyamidonietJiyiation of MRP 14. MRP 14 (5 
nmol) either native, reduced, or oxidized recombinant 
isolated by C4 RP-HPLC were treated with iodoacet- 
amide (19 pig, 100 nmol) in ammonium bicarbonate 
(100 mM, 250 ^1) in the presence of guanidine-HCl (4 
M) for 30 min at 30X. After acidification (1% TFA). the 
mixture was loaded onto the C4 RP column and pro- 
teins were eluted with a gradient of 25 to 70% aceto- 
nitrile (0.1% TFA) at 1 ml/min over 30 min. The deri- 
vatized proteins were collected manually. 

Mass spectrometry. ESI mass spectra were ac- 
quired using a single-quadrupole mass spectrometer 
equipped with an electrospray ionization source (Plat- 
form, VG-Fisons Instruments, Manchester, UK). Sam- 
ples (—50 pmol, 10 /a1) were injected into a moving 
solvent (10 /xl/min; 50:50 water: acetonitrile, 0.05% 
TFA) using a Phoenix 40 HPLC pump (VG-Fisons In- 
struments) coupled directly to the ionization source via 
a fused silica capillary (50 ;xm X 40 cm). The source 
temperature was 50°C and nitrogen was used as the 
nebulizer and drying gas. Sample droplets were ionized 
at a positive potential of approximately 3 kV and trans- 
ferred to the mass analyzer with a cone voltage (sample 
cone to skimmer lens voltage) of 50 or 70 V. The peak 
width at half height was 1 Da. Spectra were acquired 
in multichannel acquisition mode over the mass range 
700 to 1800 Da in 5 s and then calibrated with horse 
heart myoglobin (Sigma). 

Peptide mapping. Proteins (50 /xg) isolated from C4 
RP-HPLC were digested in ammonium bicarbonate 
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(250 /xl, 50 mM, pH 8.0) using endoprotease Glu C 
(sequencing grade, Boehringer-Mannheim, Castle Hill, 
NSW) at an enzyme to substrate ratio of approximately 
1: 100 at 37°C for 6 h. The pH of the digest was lowered 
to approximately 2 (1% TFA) and the mixture was 
applied directly to a C18 RP column (Waters, Novapak, 
60 A, 4 /xm, 150 X 3.9 mm). Peptides were eluted with 
a gradient of 5 to 75% acetonitrile (0.1% TFA) at 1 
ml/min over 30 min. Peptides were collected manually 
and lyophilized to dryness, and wateriacetonitrile (1:1, 
50 fxl) was then added before ESI/MS analysis. 

Zinc overlay. Native mMRP14, rec mMRP14, and 
rec mMRP14|_io2 (-^500 ng) were separated by SDS- 
PAGE. Proteins were transferred to PVDF at 75 V 
for 25 min at 4°C. After transfer, membranes were 
incubated in TBS or TBS + CaClz (1 mM; 15 ml) 
containing ^^Zn^^ (25 /iCi, 10 mCi/mmol, Amersham) 
and solutions were agitated for 30 min. Membranes 
were washed with TBS (3 X 15 ml) for 5 min and 
radioactivity was located using a phosphorimager 
and BI screen (GS-525, Bio-Rad). Images were ana- 
lyzed using the Multianalyst program (Bio-Rad). 
Proteins were quantitated using a densitometer 
(Bio-Rad) after staining membranes with Amido 
black (Bio-Rad) according to the manufacturer's pro- 
cedure. 

RESULTS AND DISCUSSION 

The pGEX expression system produces soluble re- 
combinant proteins as a fusion with glutathione 5- 
transferase (GST), enabling isolation from bacterial 
lysates by affinity chromatography under nondenatur- 
ing conditions. The fusion protein is digested at a 
cloned consensus site with either thrombin or factor Xa 
producing GST and recombinant protein with two ad- 
ditional amino acids (GlySer) at the N-terminus (22). 
Bacterial recombinant proteins are generally produced 
without disulfide bonds and must be refolded/oxidized 
to achieve native structures (23). 

ESI/MS is a rapid and precise method for determin- 
ing the mass of proteins and peptides. It can validate 
protein sequences (24,25) and has been used to char- 
acterize many recombinant proteins, including the 
SlOO proteins calvasculin (26), calcyclin (26), and 
SI 00 A3 (27). In these examples the proteins were ex- 
pressed in E. coJJ as full-length proteins, the identity of 
each confirmed by ESI/MS after comparison of theoret- 
ical and experimental masses. No mutant or posttrans- 
lationally modified forms were identified. Recombinant 
CP 10 was produced using the pGEX expression system 
and a mutant form containing an additional 10 C- 
terminal amino acids isolated and characterized using 
ESI/MS (15). 



TABLE 1 

ESI Masses of Recombinant Proteins and Calculated 
Masses of Proteins Derived from Proposed 
cDNA Sequences 



Protein 


C4 retention 
time (min) 


Calculated^ 
cDNA mass 


ESI mass (Da) 


Fusion protein 


22.6 


39.210.4 


39.312" 


GST 


22.5 


26.166.6 


26.164* 


MRP 14, .,02 


21.1 


11.918.6 


11.919 


MRP14,«,_d 


20.5 


13.061.9 


13,062" 


MRP14o.,d.,cd 


19.4 


13.059.9 


13.06r 



" Masses were calculated from protein sequences derived from 
previously determined cDNA sequences. 

* A small peak (—20%) 131 Da less was also observed, possibly due 
to removal of the initiator Met. 

" Differentiation between reduced and oxidized forms is not possi- 
ble because the mass difference corresponds to the approximate 
mass error of ±2 Da. 

Isolation and ESI/MS of Recombinant mMRPl4 

The coding nucleotide sequence of the rec mMRP14 
expression plasmid was verified by automated nucleo- 
tide sequencing (not shown). After bacterial lysis and 
isolation of the fusion protein using glutathione-aga- 
rose, the protein was eluted with excess glutathione 
and isolated after C4 RP-HPLC. The mass of fusion 
protein by ESI/MS was 39,213 Da, which compares 
favorably with the theoretical mass calculated from the 
fusion protein sequence derived from the constructed 
GST-mMRP14 cDNA (Table 1). The minor peak of 131 
Da less than the full-length fusion protein was proba- 
bly due to modification of the fusion protein by removal 
of the initiator Met (Fig. lA). Glutathione 5- trans- 
ferase isolated after thrombin cleavage of fusion pro- 
tein immobilized on GSH-agarose for 24 h had a mass 
of 26,164 Da (Fig. IB) which also compared favorably 
with the theoretical mass (Table 1). A protein with a 
mass of 131 Da less than GST was also resolved (Fig. 
IB) indicating a modification within the N-terminal 
portion of the fusion protein and supporting the pro- 
posal that the initiating Met had been removed. 

After thrombin cleavage for 3 h, two proteins with 
retention times of 19.4 and 21.1 min were observed by 
analytical C4 RP-HPLC. SDS-PAGE indicated that 
both had molecular weights of ~ 13,000 Da and were 
immunoreactive with polyclonal antibodies to native 
mMRP14 by Western blotting (not shown). Only a sin- 
gle peak at 21.1 min was observed after prolonged 
cleavage [T 24 h), whereas digestion for 1 h yielded 
essentially one component that eluted at 19,4 min. The 
mass of the early eluting peak was 13,062 Da (Fig. 2 A) 
which corresponds to the expected theoretical mass of 
rec mMRP14 (Table 1) calculated from the derived 
protein sequence. The mass of the later-eluting protein 
was 11,919 Da (Fig. 2B). Analysis of the amino acid 
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FIG. 1. ESI mass spectra showing multiply charged protein ions of 
rec mMRP14 fusion protein (A) and GST derived from thrombin 
cleavage of fusion protein (B). Deconvoluted masses are shown in 
insets. 



sequence indicated that this corresponded to the ex- 
pected mass of rec mMRP14,_io2. The amino acid se- 
quence ( . . . KLHENNPRGHGH . . . ) contains a 
thrombin-like consensus sequence, NNPRGH, indicat- 
ing that the truncated form was produced after diges- 
tion at this site. Optimum cleavage sites for thrombin 
have structures P4-P3-Pro-Arg-Pl '-P2', where P3 and 
P4 are hydrophobic amino acids and P'l and P'2 are 
nonacidic amino acids or P2-Arg-Pl', where P2 and PI' 
are Gly (28). Both the engineered and native cleavage 
sites within the mMRP14 fusion protein contain the 
Pro-Arg-Gly consensus sequence. The difference in 
rate may be attributed to nonhydrophobic residues 
(Asn-Asn) located in the P4 and P3 positions of site 2, 



whereas site 1 contains the hydrophobic amino acids 
Leu-Val at P4 and P3, which is optimal for cleavage. 
Almost quantitative digestion of fusion protein was 
achieved after 1 h to yield 1.5 mg of >95% full-length 
recMRPH. Semipreparative C8 RP-HPLC removed re- 
sidual MRP14i.,o2. 

Oxidative Refolding of rec mMRP14 

Recombinant proteins produced in E. coli are often 
incorrectly folded, without appropriate disulfide bond 
formation, and normally form insoluble aggregates (in- 
clusion bodies) during high levels of transcription (17). 
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FIG. 2. ESI mass spectra showing multiply charged protein ions of 
the two forms of rec MRP 14 isolated after C4 RP-HPLC. Full-length 
rec mMRP14 (A) and mMRP14i_io2 (B) derived from thrombin cleav- 
age of fusion protein for 1 or 24 h. respectively. Deconvoluted masses 
are shown in insets. 
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Soluble recombinant proteins without disulfide bonds 
are also produced. Conditions for formation of native 
structures must be determined empirically for each 
protein; however, renaturation and oxidation may oc- 
cur concomitantly with formation of specific disulfide 
bonds following solubilization/denaturation and rena- 
turation (17.29). Generally, correctly folded native 
structures form in vitro after oxidative refolding follow- 
ing solubilization in urea or guanidine-HCl followed by 
dilution or dialysis. Proteins do undergo spontaneous 
oxidation in air after dilution or dialysis to yield in- 
tramolecular disulfide bonds. The rate of oxidation can 
be assisted by addition of mixtures of oxidized and 
reduced low-molecular-weight thiols (e.g. ,GSSG/GSH 
or cysteine/cystine) (29). Cu^^, salts, detergents, and 
elevated temperatures can also facilitate refolding/di- 
sulfide bond formation (17). These reagents allow ac- 
celerated formation/interconversion of unstable folded 
intermediates and subsequent formation of thermody- 
namically stable disulfide bridged products, which nor- 
mally represent biologically active conformations (29). 

Native mMRP14 has three Cys residues near the 
C-terminus at positions 79, 90, and 110. We showed 
previously that native mMRP14 (isolated from acti- 
vated spleen cell supernatants) contains a specific in- 
tramolecular disulfide bond located between Cys 79 and 
Cysgo (13). Due to changes in relative hydrophobicity, 
the C4 RP-HPLC retention times of native and reduced 
mMRP14 differed by —60 s facilitating characteriza- 
tion of the disulfide bonded form (13). Murine MRP 14 
is a cytosolic neutrophil protein and would be expected 
to be in a reduced form intracellularly. The disulfide- 
bridged form may be the preferred extracellular struc- 
ture and may represent the functional form. Other 
SlOO proteins are physiologically activated by oxida- 
tion. Oxidized SI 00)3, a glial cell growth factor, con- 
tains an intramolecular disulfide bond between Cyses 
and Cys84 and is the most potent S 100/3 species to 
stimulate C6 glial cell proliferation (19). Oxidation of 
this protein also causes a conformational change which 
allows phosphorylation of a canonical CKII-a site lo- 
cated within the C-terminal Ca^^ binding domain, 
which may be functionally significant (19). 

Recombinant mMRP14 was refolded and oxidized by 
prolonged dialysis at neutral pH. Figure 3 shows the 
time course of oxidative refolding of rec mMRP14 an- 
alyzed by C4 RP-HPLC. After 24 h, three different 
forms were identified, corresponding to oxidized 
mMRP14 (Fig. 3B, peak 1), the fully reduced protein 
(Fig. 3B, peak 2), and a transient folding intermediate 
(Fig. 3B, peak 3) of unknown structure. Analysis by 
SDS-PAGE, with silver staining and Western blotting, 
and ESI/MS indicated that the latter contained mix- 
tures of monomeric and dimeric mMRP14 (not shown). 
The intensities of peaks 2 and 3 were reduced by pro- 
longed dialysis until at 96 h the oxidized form (peak 1) 
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FIG. 3. RP-HPLC trace of full-length rec mMRP14 with time- 
dependent conversion of reduced rec MRP 14 (peak 2) to oxidized rec 
MRP 14 containing a disulfide bond between Cysyg and Cys^ (peak 1). 
Proteins were dialyzed in ammonium acetate. pH — 7, for (A) T — 
0 h, (B) r = 24 h, (C) r= 48 h. and (D) r= 96 h. The identity of peak 
3 is unknown (see text for details). 



represented the major structural form (Fig. 3D). The 
yield of peak 1 did not increase with more prolonged 
dialysis. The mass of peak 1 was identical to that 
expected for oxidized MRP 14 (13061 Da) and the yield 
after semipreparative C8 RP-HPLC was —55%. The 
results from each step are summarized in Table 2. The 
mass of the iodoacetamide derivative of oxidized rec 
mMRP14 was 13,117 Da. This corresponds to deriva- 
tization of a single free sulfhydryl (+57 Da) indicating 
that oxidized rec MRP 14 contains one internal disul- 
fide bond. 
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TABLE 2 

Summary of the Expression, Digestion, and 
Isolation of rec MRP 14 



Step 


Protein 


Total protein 
(mg) 


Yield 
(%) 


Culture/liter 


GST-MRP 14 


4.7 




Thrombin (1 h) 


MRP 14 


1.5 


95 


Thrombin (14 h) 


MRP14,.,o2 


1.4 


95 


Oxidation (96 h) 


MRPl4,^di,ed 


0.8 


55 



The three Cys residues in mMRP14 can potentially 
form three distinct intramolecular disulfide-bonded 
isomeric forms, CysTg-Cysgo, Cysgo-CySno, and Cysyg- 
CysiiQ. Endoprotease Glu C digest products of native 
mMRP14 were characterized by ESI/MS and Edman 
sequencing after separating peptides by C18 RP-HPLC 
(13). The peptide mMRP 14 79.92 was isolated and con- 
tained an intramolecular disulfide bond (13). The in- 
tramolecular disulfide bond produced after oxidative 
refolding of rec mMRPH was located by comparing 
C18 RP-HPLC traces of Glu C digest products of native 
mMRPH, rec mMRP14, and oxidized rec mMRP14. 
Apart from slight differences in relative intensities, 
essentially the same chromatographic profiles were ob- 
tained for each protein and a peptide corresponding to 
MRP 14 

79-92 (rt — 18.3 min) was isolated from each 
digest. The mass of these peptides was 1640 ±2 Da and 
corresponded to the predicted mass of MRP 14 79.92 
(1641.1 Da reduced or 1639.1 Da oxidized). No peptides 
corresponding to predicted fragments with disulfide 
bonds to Cys no were identified, supporting disulfide 
bond formation between the same Cys residues in na- 
tive mMRP14 and oxidized rec mMRP14. A peptide 
corresponding to MRP 14 79.92 (ESI mass 1640 Da) was 
also isolated from Glu C digestion of the iodoacetamide 
derivative of oxidized rec mMRP14. further supporting 
the location of the disulfide bond. 

Zinc Binding 

SlOO proteins commonly bind Ca^^ and Zn^^ (30,31). 
S 100/3 binds eight Zn^^ ions, four with high affinity and 
four with low affinity, probably via His residues, but 
the precise binding sites are uncharacterized (30). Ca^^ 
antagonizes low-affinity Zn^^ binding due to a confor- 
mational change (30). These properties may explain 
the multiple interactions and different extracellular 
and intracellular targets of SlOO j3 in the brain (5). Both 
human and murine MRP 14 bind substantially more 
Zn"^"^ than either S100A8 (CPIO) or SlOOjS (13). 

Equal quantities of native mMRP14, rec mMRP14, 
and rec mMRP14,_,02 were separated on SDS-PAGE, 
blotted onto PVDF, and incubated with radioactive 
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•^^ZnClz. Native mMRP14, containing the disulfide 
bond between Cysyg and Cysgo, and reduced rec 
MRP 14 bound approximately equal (±5%) amounts 
of ^^Zn^^ (Fig- 4, lanes 1 and 2, respectively) indicat- 
ing that the Cys residues are not involved in Zn^^ 
binding, whereas binding was reduced by 80% (±5%) 
with similar quantities of rec mMRP14|_io2 (Fig. 4, 
lane 3). All ®^Zn^^ binding was inhibited by excess 
ZnClz (not shown), although excess CaCl2 did not 
inhibit Zn^^ binding indicating that the binding site 
was distinct and independent from the two Ca^^ 
binding domains (Fig. 4, lanes 4, 5, and 6). Minor 
bands (<5%) at approximately 28,000 were attrib- 
uted to disulfide-bonded dimeric forms of MRP 14 
(Fig. 4). Overlay experiments suggest that full- 
length mMRP14 has substantially higher affinity for 
Zn^"^ than mMRP14,_,02 (Fig. 4). Analysis of the 
amino acid sequence of mMRP14 does not indicate 
putative "zinc fingers" (32) or clusters (4-12) of 
acidic amino acids (33) which potentially may bind 
Zn^^, but the protein contains seven His and an 
unmodified Cys, some of which may adopt an appro- 
priate conformation for binding (34). Zinc binding 
has also been proposed to occur at His residues in the 
sequence HEXXH in hMRP14 and may be implicated 
in its ability to inhibit antimicrobial growth (11). 
mMRP14 has two such motifs, located at either side 
of the putative thrombin cleavage site, and both 
could potentially bind Zn^^. The C-terminal peptide 
generated by thrombin digestion has the sequence 
GHGHSHGKGCGK. The three His and one Cys res- 
idue could also potentially chelate Zn^^ indepen- 
dently or with other residues located within 
mMRP14. Additional experiments are required to 
define the precise Zn^^ binding sites of mMRP14, but 
experiments reported here confirm the involvement 
of the extreme C-terminal motif in Zn^"^ chelation. 

In conclusion, we have characterized two forms of 
rec mMRP14, the full-length protein and a truncated 
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FIG. 4. ^^Zn^^ binding after transfer of proteins separated by SDS- 
PAGE to PVDF. native mMRPl4 (lane 1). rec mMRPI4 (lane 2). and 
rec MRP14,.,o2 (lane 3) in TBS and binding to native niMRP14 (lane 
4), rec mMRP14 (lane 5), and rec MRPHnoz (lane 6) in TBS + CaClz 
(1 niM). 
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form, MRP14i_io2. produced by thrombin cleavage at 
an alternate site. Interestingly, MRP 14 released by 
necrotic neutrophils at sites of inflammation (35) 
would be located in regions of active fibrin formation 
and the susceptibility of this site to thrombin cleav- 
age may have functional relevance. Oxidative refold- 
ing of the recombinant protein readily occurred at 
pH ~ 7 in ammonium acetate buffer yielding a pro- 
tein with intramolecular disulfide bonds equivalent 
to those found in the native protein. This will allow 
more detailed structural characterization of 
mMRP14 and more reliable functional analysis. Zn^^ 
binding by the truncated form of mMRP14 was 
greatly reduced, suggesting that residues in this do- 
main are important in Zn^^ binding to mMRP14. The 
serendipitous identification of a thrombin cleavage 
product with reduced capacity to bind Zn^^ may have 
physiological relevance. 
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The nitrogen assimilation control protein (NAC) from Klebsiella aerogenes or Escherichia coli (NACk or 
NACq, respectively) is a transcriptional regulator that is both necessary and sufficient to activate transcription 
of the histidine utilization (hut) operon and to repress transcription of the glutamate dehydrogenase (gdh) 
operon in K. aerogenes. Truncated NAC polypeptides, generated by the introduction of stop codons within the 
nac open reading frame, were tested for the ability to activate hut and repress gdh in vivo. Most of the NACk 
and NACe fragments with 100 or more amino acids (wild-type NAC^ and NAC^ both have 305 amino acids) 
were functional in activating hut and repressing gdh expression in vivo. Full-length NAC^ and NACe were 
isolated as chimeric proteins with the maltose-binding protein (MBP). NACk and NACe released from such 
chimeras were able to activate hut transcription in a purihed system in vitro, as were NACk129 and NACglOO 
(a NACk fragment of 129 amino acids and a NACe fragment of 100 amino acids) released from comparable 
chimeras. A set of NACe NACk fragments carrying nickel-binding histidine tags (his^ at their C termini 
were also generated. All such constructs derived from NACe were insoluble, as was NACe itself. Of the 
his^-tagged constructs derived from NACk, NACkIOO was inactive, but NACk120 was active. Several NAC 
fragments were tested for dimerization. NACKl20-his6 and NACKlOO-his^ were dimers in solution. MBP-NACk 
and MBP-NACk129 were monomers in solution but dimerized when the MBP was released by cleavage with 
factor Xa. MBP-NACe was readily cleaved by factor Xa, but the resulting NACe was also degraded by the 
protease. However, MBP-NACE-his6 was completely resistant to cleavage by factor Xa, suggesting an interac- 
tion between the C and N termini of this protein. 



The nitrogen regulatory (Ntr) system of enteric bacteria 
allows Klebsiella aerogenes, Escherichia coli, and other enteric 
bacteria to respond rapidly and effectively to changes in the 
quality of the nitrogen source provided (13). Under conditions 
of nitrogen limitation, a complex cascade of regulatory events 
involving uridylylation and phosphorylation reactions leads to 
the accumulation of the phosphorylated (active) form of the 
transcriptional activator NtrC. NtrC-phosphate can activate 
the a^'^-dependent expression of a variety of genes involved in 
utilization of organic and inorganic nitrogen sources. NtrC- 
phosphate also activates transcription of the nac gene in K 
aerogenes and E. coli (6). 

The nitrogen assimilation control protein (NAC) is a regu- 
latory protein responsible for activating the transcription of 
operons such as hutUH,putP, and ureDABCEFG, whose prod- 
ucts supply the cell with ammonia or glutamate from histidine, 
proline, and urea, respectively (2). NAC is also responsible for 
repressing transcription of the gdhA and gltBD operons, whose 
products are involved in assimilating ammonia under condi- 
tions of nitrogen excess or limitation (2). NAC is also respon- 
sible for down-regulating its own transcription (3, 7). The ex- 
pression of the nac gene is entirely dependent on the Ntr 
system such that under conditions of nitrogen-limited growth, 
the Ntr system is active and nac is actively transcribed by RNA 
polymerase charged with the minor sigma factor (6, 12). 
There are no known coeffectors involved in the regulation of 
transcription by NAC (17), so if nac is expressed, then NAC is 
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produced and activates transcription of hut, put, and ure by 
RNA polymerase charged with the major sigma factor o- . 
Thus, NAC serves as a coupling factor between the o-^'*-depen- 
dent Ntr system and o-^°-dependent genes like hut (2). 

NAC is a member of the large family of LysR-type transcrip- 
tional regulators (LTTRs; 15, 16). This group of over 50 pro- 
teins are identified on the basis of amino acid sequence simi- 
larity. Most of the members of this family have between 300 
and 350 amino acids and show a surprisingly high degree of 
sequence similarity. For example, NAC from K aerogenes 
shows about 40% sequence identity with OxyR from E. coli 
(16). Within the LTTRs, the DNA-binding domain occurs near 
the N terminus of the protein, where a predicted helix-tum- 
helix motif (approximately amino acids 20 to 40 in NAC) is 
located (15). In several LTTRs, the binding site for a required 
coeffector has been identified. These binding sites are gener- 
ally in the C-terminal third of the polypeptide, suggesting that 
the DNA-binding and coeffector-binding domains are physi- 
cally separated (15). 

Very little structural information about LTTRs is available, 
A fragment of CysB containing the C-terminal 233 amino acids 
has been crystallized, and a structure was derived from X-ray 
diffraction data (18). However, this fragment lacks the N-ter- 
minal 87 amino acids and thus the DNA-binding domain. As a 
result, this structure says little about how the DNA-protein 
interactions occur, are altered by the presence of inducer, or 
result in the activation of transcription. 

Comparison of NAC from E. coli (NAC^) with NAC from 
IC aerogenes (NAC^) revealed a surprising lack of sequence 
similarity (14). NACg was only about 75% identical to NACk, 
in contrast to other E. coli and K aerogenes regulatory proteins, 
which are >95% identical. Most of the sequence divergence 
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TABLE 1. Strains and plasmids used in this study 



Strain or 
plasmid 



Description 



Source or 
reference 



KC2668 hutC5J5 ^[bla]-2 nac* 

KC2725 hinC5I5 A[6/a]-2 «flc-205::Tn5-/37 

K:C2972 KC2941/pCB51 1 

KC3220 KC2725/pCB594 

KC3555 KC2725/pCB771 

KC3568 KC2725/pCB774 

KC3572 KC2725/pCB778 

KC3578 KC2725/pCB781 

KC3580 KC2725/pCB782 

KC359 1 KC2725/pCB788 

KC361 5 KC2725/pCB800/pCB648 

KC3973 KC2725/pCB941 

KC3977 KC2725/pCB529 

KC3981 KC2725/pCB606 

KC4510 KC2725/pCB547 

Plasmids 

pCB511 pGB2; wild-type wac^" 
pCB529 pGB2; nac-10 allele of E. coli 
pCB547 pGB2; nac-2S allele of E. coli 
pCB594 pMalC2; MBP-NACe fusion protein 
pCB606 pMalC2; MBP-NACk fusion protein 
pCB771 pGD103; NAQ-his^ 
pCB774 pGD103; wild-type NAC^ 
pCB778 pGD103; wild-type NACk 
pCB781 pGD103; NACK-his^ 
pCB782 pGD103; NACk- 

(DFGRSGHTDSL) 
pCB788 pGD103; NACKlOO-hiSg 
pCBSOO pGD103; NACKl20-his6 
pCB941 pGD103; NACElOO-his^ 
pGB2 Low-copy cloning vector; insert 

expressed from its own promoter 
pGD103 Low-copy vector expressing inserted 

material from lacP 
pMalC2 MBP fusion vector expressing 

inserted material from tac 

promoter 

" Vector; cloned insert or product. 
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between NAC^ and NAC^ occurs in the C-terminal two-thirds 
of the protein, consistent with the lack of a need to conserve a 
coeffector-binding site in the C-terminal domain. This raised 
the question of whether the C-terminal domain has any role at 
all, leading us to seek the smallest N-terminal fragment that 
would retain the biological activities associated with NAC. 

Two nac mutations also suggested that a C-terminal portion 
of NAC might be dispensable, further encouraging us to search 
for a small active fragment of NAC. The nac-IlZ allele of K 
aerogenes is an insertion of M\i6lac,amp at an unknown site 
within nacj^^ which retains considerable NAC activity (12). The 
nac-10 allele of E. coliy an insertion of a stop codon and a drug 
resistance cartridge in the center of the nactr gene, was con- 
structed as an intermediate in constructing the nac-28 null 
allele (14). Strains carrying nac-10 also retained considerable 
NAC activity. Thus, we attempted to determine whether the 
C-terminal domain of NAC from E. coli and K. aerogenes plays 
an essential role and to define the smallest N-terminal frag- 
ment of NAC that would retain the ability to function as a 
transcriptional regulator. 

MATERIALS AND METHODS 

Strains and plasmids. All of the K aerogenes strains used in this study were 
derived from WTO and are listed in Table I. 



Generation of C-terminal deletions of NAC. Unidirectional exonucIea.se Hi 
deletions were performed to make 3' deletions in the E. coli nac gene by using 
the protocol described by Henikoif (9) with minor modifications. Plasmid 
pCB574 contains the E. coti nac gene inserted into the EcoKl site of plasmid 
pGB2 (4) such that the 3' end of the gene is proximal to the Pst\ site present in 
the multiple cloning site. Deletions were initiated by cutting with Sail and Pst\ to 
allow unidirectional deletions into nac. After digestion with exonuclease HI and 
mung bean nuclease, the deleted DNA fragments were resuspended in 20 nl of 
ligation buffer (250 mM Tris-HCI [pH 7.61, 50 mM MgCU. 1 mM dithiothreitol, 
0.5 mM ATP) and 100 pmol of a dephosphorylated Nhc\ linker (containing stop 
codons in three reading frames) to which T4 DNA ligase was added. The ligation 
mixture was used to transform E. coli DHSa to streptomycin (50 fxg/ml) and 
spectinomycin (100 M-g/ml) resistance. Deletions of interest were sequenced by 
using the pUC reverse primer to determine the precise endpoint of each dele- 
tion. These were brought under the control of the lac promoter by PGR ampli- 
fication by using the primers CGAATTCAAACTGGAGACTCATATGAAC 
(forward) and AGGATCCTCACACAGGAAACAGCTATGAC (reverse) and 
insertion into the £co RI and BamHX sites of pGD103 (5). Specific truncations in 
the K. aerogenes nac gene were generated bv PGR using a forward primer 
(CTGGAATTCCTTACAGGAGGCA) containing an EcoRI site and reverse 
primers complementary to the last 18 bases prior to the desired truncation point 
followed by an amber stop codon and GGATCC {BamHV). PGR products were 
introduced into the EcoKl and BamH\ sites of plasmid pGD103, and the se- 
quences were verified by dideoxy sequencing. 

Purification of MBP-NAC fusions. The nac genes from E. coli and K aero- 
genes, along with the G-terminal truncated derivatives of each, were inserted into 
vector pMalC2 (New England Biolabs) as a blunt-ended 5' end of the nac gene 
beginning at the initiating methionine ligated to the blunt-end Xmn\ site within 
the vector. The 3' ends were ligated into either the £:coRI or BamHl sites within 
the multiple cloning site. The initial construct was prepared by PGR and se- 
quenced to verify that errors had not been introduced by PGR amplification. 
Later derivatives were made by replacing sequences within this clone by using 
internal sites (Xmnly Sspl, and BgtU) in the nac gene rather than reamplifying for 
each construct. Insertion of a blunt>ended DNA fragment into the ATmnl site of 
the vector results in fusion of the NAC protein to the 42-kDa maltose-binding 
protein (MBP). The junction between the two proteins contains the recognition 
site for factor Xa protease (He Glu Gly Arg) positioned such that cleavage with 
this enzj'me would result in separation of the intact NAG protein from the fusion. 

Purification of the fusion protein was initiated by inducing mid-log-phase 
cultures (3 x 10^ GFU/ml) with 0.5 mM isopropyl-p-cnthiogalactopyranoside 
(IPTG). The induced cultures (500 ml in 2-liter flasks) were shifted from 37 to 
28*G and grown for an additional 2 h. Cells were chilled on ice and harvested by 
centrifugation. The pellets were washed twice in column buffer (100 mM Tris- 
HQ [pH 7.5], 250 mM NaO, 1 mM dithiothreitol, 1 mM 2-mercaptoethanol) and 
resuspended in a final volume of 15 ml. Cell disruption was accomplished by 
passage through a French pressure cell (twice at 12,000 Ib/in^). The resulting 
lysate was clarified by centrifugation for 30 min at 30,000 X ^ at 4*C. The 
supernatant solution containing the fusion protein was increased in volume to 30 
ml with column buffer containing 50 mM freshly prepared phenylmethylsulfonyl 
fluoride. The crude preparation was applied to a column (2.5 cm in diameter) 
containing a 10-ml bed volume of amylose-agarose resin (New England Biolabs 
no. 800-21 L) at a flow rate of 1 ml/min. Bound fusion protein was washed with 
300 ml of column buffer at a 2-mI/min flow rate and eluted with column buffer 
supplemented with 1 M maltose at a rate of 1.5 ml/min with 3-ml fractions 
collected over 50 ml. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis 
(SDS-PAGE) was used to determine which fractions contained MBP-NAG. The 
MBP-NAG fusion generally eluted over a wide peak from fractions 3 to 7. 
Fractions were pooled if SDS-PAGE indicated sufficient purity. Protein concen- 
trations were determined by Bio-Rad protein assay and generally indicated 
values between 0.7 and 2.5 mg/ml. Diafiltration was performed by using either an 
Amicon 10- or 25-ml stir cell apparatus with PM30 filters to concentrate samples. 
Fusions were stored at -20''G following the addition of an equal volume of 
glycerol. If the fusion was to be cleaved or used immediately, it was kept on ice 
at all times. Diafiltration was also used to change the buffer in which the MBP- 
NAG was suspended to buffer 4 (8), which contains 100 mM NaHiPOu (pH 7.0), 
250 mM NaQ, 2.5 mM MgClz, and 1 mM 2-mercaptoethanol. The NAG protein 
was not transcriptionally active as a fusion protein with MBP. Therefore, to 
retrieve active NAC from these fusions for in vitro transcription experiments, it 
was necessary to cleave NAC from the fusion by using factor Xa protease. This 
was accomplished by adding 1 jig of protease to 0.5 mg of the fusion protein in 
a 100-ml volume and then incubating it for 16 h at 22*0 Following cleavage, 
freshly prepared phenylmethylsulfonyl fluoride was added to 20 mM. The 
cleaved NAC fusion protein had to be used immediately because it degraded 
quickly over time. 

Purification of his«*ta^ed NAC. To overcome the problems associated with 
MBP-NAC fusions, various six-histidine (his^O-tagged constructs were built. 
NAG N-terminal constructs were made by cloning nac genes into the 
£coRI///mdIII sites of pET28 (Novagen). The N-terminal fusions added a leader 
to the NAG proteins consisting of 56 amino acids which include a stretch of six 
histidines in a row to facilitate nickel-nitrilotriacetic acid affinity purification 
(Qiagen). Constructs in which carboxy-terminal hisi^ tags were added had the 
DNA sequence (CAT)^ followed by an amber stop codon designed into the 
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primers used to amplify the nac gene. The C-terminal his^-tagged genes were 
inserted into a derK'ative of the pET28 vector in which the material between the 
Xhol and EcoRl sites had been removed. Histidine-tagged NAC proteins were 
expressed as previously described for the MBP fusions and prepared for affinity 
purification in a similar fashion, except that the pH of the buffer was raised to 7.8 
to facilitate nickel interaction. Qarified S30 French pressate was added to a 
column with an internal diameter of 1.25 cm containing a 10-ml bed volume of 
nickel-nitriloiriacetic acid resin (Qiagen). NAC was allowed to bind the column 
at a flow rate of 2 ml/min. The column was washed with 150 ml of column buffer 
supplemented with 5 mM imidazole to eliminate nonspecific binding. NAC was 
eluted with elution buffer (300 mM imidazole [pH 7.5J, 250 NaQ, 1 mM MgCli, 
1 mM 2-mercaptoethanol). Fractions of 3 ml were collected and screened by 
Bio-Rad protein assay. NAC generally eluted in a tight peak in fraction 3. Purity 
was monitored by SDS-PAGE. 

Gel filtration analysis of purified proteins. Gel filtration chromatography was 
used to determine the size and oligomeric state of purified NAC proteins. Size 
determinations were made by one of two methods: fast protein liquid chroma- 
tography (FPLC)-mediated gel filtration or gravity-fed chromatography. For 
determinations by FPLC, a Pharmacia LCC-501 FPLC apparatus was used in 
conjunction with a Superdex 75 HR 10/30 prepacked column. The column was 
equilibrated with buffer 4 (described above) and loaded with 30 jjlI (75 M-g) of 
NAC. Separation was allowed to continue at a flow rate of 1 ml/min for 40 min. 
Detection of etuting protein was done by monitoring /1 28o- Standardization of the 
column was done with gel filtration protein standards purchased from Sigma 
Chemical Co. resuspended in buffer 4. Generally, dimeric NAC (66-kDa dimer) 
eluted at a 15-ml volume at the same fraction as bovine serum albumin (66 kDa). 
Gravity flow gel filtration was performed by using an SR 25/100 column (25-mm 
internal diameter) packed with a l50-ml bed volume of Sephaciyl S-100. Stan- 
dardization of this column was done with the same buffers, globular weight 
standards, and concentrations as the FPLC, but the loading volume was in- 
creased to i ml. The flow rate was adjusted to 0.5 ml/min, and 150 ml was allowed 
to flow. Generally, dimeric NAC (66 kDa) eluted at a volume of 16 ml from this 
column. 

In vitro transcription assays, in vitro transcriptions were carried out essen- 
tially as described previously (8). The template used was plasmid pCB695, which 
carries the K. aerogenes hutUH promoter inserted into vector pTE103, similar to 
pAM1202, which was described previously (8). Transcription mixtures contained 
0,16 pmol of supercoiled plasmid as the template and 2 to 6 pmol of RNA 
polymerase in a total volume of 25 

Enzyme assays. Histidase and glutamate dehydrogenase (GDH) activities 
were measured as described previously (12). Specific activities are reported as 
nanomoles of urocanate formed (histidase) or NADPH2 oxidized (GDH) per 
minute per milligram of cell protein. Cell protein was determined by the method 
of Lowry et al. (1 1). 

Gel mobility shift assays. Gel mobility shift assays were performed by using 
purified NACn or NACk or fragments thereof. The ability of NAC to bind DNA 
was determined by using the 330-bp £coRl-to-Wmdin fragment of the wreD 
promoter, which contains a strong NAC-binding site. Binding reaction mixtures 
consisted of 1 jjlI of DNA (0.05 pmol), 1 fil of poly(d(I • C) (50 ng/ml), 4 \l\ of 
deionized distilled HjO, and 1 p.! of a NAC dilution (0.35 to 1.7 pmol) in 50 mM 
NaHsPO* (pH 7)-l25 mM NaCI-0.5 mM MgClj-Cl mM p-mercaptoethanol- 
50% glycerol, 1-mg/ml bovine serum albumin. NAC was incubated with the DNA 
for 30 min before addition of 1.5 p.1 of loading buffer (40 mM Tris [pH 8.4], 4 mM 
EDTA, 02% bromthymol blue, 0.2% xylene cyanol, 15% Ficoll) and applied to 
a 2% Tris-borate-EDTA agarose gel. Electrophoresis was carried out at room 
temperature at 15 V/cm for approximately 1 h. Gels were stained with ethidium 
bromide, and the fragments were visualized by UV fluorescence. 



RESULTS 

Residual NAC activity in nac-10 mutants. The nac- 10 allele 
of the coli nac gene has a stop codon and a kanamycin 
resistance gene inserted into the middle of nac^r after codon 
165. The resulting NAC polypeptide has four amino acids 
encoded by the polylinker fused to the 165th amino acid of 
NAC; the C-terminal 140 amino acids of NAC are not present. 
The nac gene of E. coli was replaced with nac-10 as described 
elsewhere (14), and the replacement was confirmed by South- 
ern blotting (data not shown). To test for NAC activity, we 
measured the ability of a strain carrying nac-10 to activate the 
NAC-dependent expression of the hut operon in response to 
nitrogen-limited growth. The nac-lO mutant showed about half 
as much histidase as the wild type under NAC-dependent 
conditions (data not shown), suggesting that the NAC protein 
encoded by nac-10 retained considerable activity. However, 
NAC-dependent activation of histidase expression in E. coli is 



TABLE 2. Residual activity of nac-10 mutation" 



Strain 


Chromosomal 
allele 


Plasmid 
allele 


Presence 
of N in 
medium 


Activity* of: 
Histidase GDH 


KC2668 




None 


+ 


28 


410 










389 


65 


KC2725 


nac-203 


None 


+ 


31 


413 










54 


436 


KC2972 


nac-203 


nac'*' 


+ 


31 


292 










460 


41 


KC3977 


nac-203 


nac-10 


+ 


35 


470 










265 


141 


KC4510 


nac-203 


nac-28 


+ 


33 


397 










59 


425 



* Strains were grown to mid-log phase in glucose minimal medium with 0.2% 
(wtA-ol) glutamine as the sole nitrogen source (-) or with a combination of 0.2% 
glutamine and 0.2% ammonium sulfate as the nitrogen source (+) and assayed 
for histidase and GDH (12). 

^ Specific activity is reported as nanomoles per minute per milligram of cell 
protein. 



weak, making it dangerous to draw strong conclusions based 
on small differences. 

NAC-dependent activation of histidase expression is stron- 
ger in K. aerogenes^ and repression of glutamate dehydrogenase 
formation by NAC serves as another measure of NAC activity. 
Therefore, we studied the effect of the nac-10 allele in K. 
aerogenes. The nac-10 allele was cloned in low-copy expression 
vector pGD103 and tested for the ability to complement a 
nac-203 mutation in K. aerogenes. The wild-type nac gene from 
E. coli was able to complement nac^203 fially, showing 15-fold 
activation of histidase formation and 7-fold repression of GDH 
expression in response to nitrogen starvation. This is similar to 
the effect seen in wild-type (nac'^) K aerogenes (Table 2, cf. 
KC2668 and KC2972). In contrast, a known E. coli null allele, 
nacj;-28, failed to complement naCf^203 with strain KC4510 
{nac-28lnac-203)y showing no more activation of histidase or 
repression of GDH expression than the nac-203 mutant, 
KC2725. Strain KC3977 (nac-lO/nac-203) showed about sev- 
enfold activation of histidase formation and threefold repres- 
sion of GDH formation in response to nitrogen starvation. 
Thus, the nac-10 mutation, with an insertion of a stop codon 
and a kanamycin resistance cassette after amino acid 165, 
retained about half as much ability to activate and repress 
transcription as wild-type nac'*' from E. coli, despite the ab- 
sence of the entire C-terminal half of NAC. 

This residual activity is particularly surprising compared to 
the nac 203 mutation of K. aerogenes. nac-203 is an insertion 
of Tn5 (or Tn5-131) after the 199th codon of nac and has one 
of the tightest Nac~ phenotypes of all of the characterized K 
aerogenes nac mutants (12). Therefore, we began a systematic 
search for the smallest NAC fragment with biological activity. 

The N-terminal 100 amino acids of NAC are sufficient to 
activate and repress transcription. A collection of mutants 
with deletions of portions of the C-terminal part of the K 
aerogenes and E. coli nac genes {nac^ and nac^, respectively) 
was generated by exonuclease III digestion as described in 
Materials and Methods. These truncated nac genes were 
cloned into pGD103, where nac expression was under the 
control of the lacZ promoter. K aerogenes nac-203 mutant 
KC2725 was transformed with the resulting plasmids, and his- 
tidase and glutamate dehydrogenase activities were deter- 
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H NACg 
■ NACk 



Length 




□ NACe 
■ NACk 



Length 

FIG. 1. hut activation (A) and gdh repression (B) by N-terminal fragments of 
NAC. K. aerogenes KC2725 {nac-203:'Tn5-132) was transformed with derivatives 
of pGD103 containing an insert that coded for either full-length NAC or trun- 
cated versions of NAC. These transformants were assayed for NAC-dependent 
activation of histidase formation and repression of GDH formation. The values 
on the X axis represent the number of amino acids remaining in the NAC 
fragment. Thus, 305 represents the wild type and 100 represents the N-terminal 
ICQ amino acids. Histidase and GDH values are reported as specific activities. 



mined as a measure of NAC-dependent activation and repres- 
sion. For convenience, the truncated proteins will be named by 
the number of the last remaining amino acid. Thus, wild-type 
E. coil NAC is NACe305 and a K. aerogenes NAC lacking the 
last five amino acids is NACk300. 

Most of the truncated genes that retained 100 or more 
codons at the amino terminus were able to complement the 
nac^203 allele for activation of histidase formation and re- 
pression of GDH formation (Fig. 1). The magnitude of the 
effect varied from about 50 to 90% as much activation or 
repression as the wild type, and the mutants retaining only 100 
amino acids of NAC (NACkIOO and NACeIOO) were among 
the most active of the truncations in vivo. This result was as 
true of the £. coU nac gene as of the aerogenes nac gene and 
appears to be a general property of NAC rather than a species- 
specific anomaly. 

The single exception in this set was NACk300, which lacks 
only the five C-terminal amino acids of NAC^. and was totally 
inactive in complementation. Western blot assays of extracts 
prepared from cells carrying wild-type NAC^ or NACk300 
were carried out by using anti-NAC serum. The wild type was 
readily seen, but NACk300 was not (data not shown), suggest- 
ing that the NACk300 product is unstable and degraded rap- 
idly. 

Although NACkIOO has nearly full activity in vivo, NACk98 
complements nacj^lOS less well, and truncations with 77, 95, or 
97 amino acids remaining failed to complement nac^203. We 
have not determined whether the failure of these very short 
NAC truncation products resulted from rapid degradation or 
an inactive protein. However, it is clear that a mutant nac gene 
that encodes only the first 100 of the 305 amino acids of NAC 



is functional in activation and repression of transcription, at 
least in vivo. 

Activity of N-terminal fragments of NAC purified using 
MBP tags. Fragments of NAC containing only the N-terminal 
100, 120, or 129 amino acids were nearly as active as wild-type 
NAC in vivo. In order to prove that these fragments were 
directly responsible for the activity, we attempted to demon- 
strate activation of transcription in vitro by using purified com- 
ponents. To purify these fragments, we fused the MBP to the 
N terminus via a peptide linker that could be cleaved with 
factor Xa protease, leaving an N-terminal methionine, as is 
seen in wild-type NAC. MBP-tagged variants of wild-type 
NAC (NACk305 and NACe305) and two N-terminal frag- 
ments (NACk129 and NACeIGO) were constructed. All of 
these fusion proteins were inactive in vivo. 

NAC fragments released from MBP-NAC fusions were un- 
stable in the presence of factor Xa (or thrombin when a throm- 
bin-sensitive linker was used); therefore, the factor Xa-medi- 
ated cleavage products of MBP-NAC were added immediately 
to an in vitro transcription reaction mixture without further 
purification. As expected, cleaved MBP-NACk305 was able to 
activate hut transcription (Fig. 2A). Cleaved MBP-NACe305 
was also able to activate hut transcription, consistent with the 
observation that the nac^ gene was able to complement a nacj^ 
mutant in vivo. This also provided the first direct evidence that 
NACe active in vitro as well as in vivo, since we had been 
unable purify NAC^ because of its insolubility under all of the 
conditions tested. Cleaved MBP-NACk129 and . MBP- 
NAC^lOO were also able to activate hut transcription almost as 
well as cleaved MBP-NACk305 (Fig. 2B). Thus, NACk129 and 
NACeIOO were able to activate hut operon expression in vitro 
as well as in vivo. 

The instability of the NAC fragments in the presence of 
factor Xa precluded any attempt to quantify NAC in these 
reaction mixtures, as well as any attempt to isolate large quan- 
tities of NAC fragments. Thus, we attempted to characterize 
NAC fragments with a nickel-binding his^ tag at the C termi- 
nus. 

Activity of NAC fragments with hiSg tags. Full-length 
NACe305 and NACk305 with the nickel-binding hiSf, tag at the 
C terminus were active in vivo. In fact, the activation of histi- 
dase and repression of GDH formation were even stronger 
with NAC305-his6 that with the corresponding unmodified 
NAC305 (Table 3). This increased activation and repression 
appears not to be specific to the nature of the C-terminal tag, 
since a cloning artifact that fused an 11-amino-acid peptide 
(DFGRSGHTDSL) to the C terminus of NACk305 also re- 
sulted in more activation and repression than NACk305 (Table 
3). Although the modified E. coll NAC (NACE305-his6) was 
active in vivo, the modification did not prove useful as a means 
of purifying NACg, since NACE305-his6 was insoluble in all of 
the buffers tested, just like NACe305. 

Three N-terminal fragments with the his5 tag at the C ter- 
minus were constructed as described in Materials and Meth- 
ods: NACEl00-his6, NACKlOO-hiSfi, and NACKl20-his6. In 
contrast to their unmodified counterparts, both NACKlOO-his^ 
and NACElOO-hiSg were inactive in vivo (Table 3). Thus, we 
focused our attention on NACKl20-his6. NACK:120-his6 was 
purified by Ni affinity chromatography and shown to be able to 
activate hut operon transcription by purified RNA polymerase 
(data not shown). Unfortunately, we have had difficulty puri- 
fying NACKl20-his5 from strains carrying either high-copy or 
low-copy expression vectors on a reliable basis. We have no 
explanation for this difficulty, although it may represent either 
instability of the polypeptide or toxicity of the fragment. 

In a final attempt to isolate an active form of NACg for 
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TABLE 3. Activation of histidase formation and repression of 
GDH formation in vivo by modified forms of NAC 



^hutUH 
transcript 



B 

1 



^ MBP-NAC100 £ MBP-NAC129 MBP-NAC305 
h^^^^ cut fc^m^^ cuttoto|^ ^m^Wjjm 

23456789 




^hutUH 
transcript 



FIG. 2. Activation of hulUH transcription in vitro by NAC and N-terminal 
fragments of NAC. In vitro transcription was carried out by using purified 
components as described in Materials and Methods. The supercoiled template, 
pCB695, was similar to pAM1202 (8) and contained the hutUH promoter region 
cloned upstream of a strong transcriptional terminator such that transcription 
from this promoter would yield a transcript with the mobility on SDS-PAGE 
indicated at the right. All reactions contained RNA polymerase, nucleotides 
(including radioactive UTP), and buffers. NAC and NAC fragments were added 
as indicated. (A) Addition of full-length NAC. Lanes: 1, no NAC; 2 and 3, 0.04 
and 0.4 ^.g of NAC (0.06 and 0,6 pmol of dimers) purified from K. aerogenes by 
salt precipitation (8); 4 and 5, 0.1 and 1 jjig of MBP>NACk (1-3 and 13.3 pmol of 
MBP-NACk monomers) treated with factor Xa for 30 min immediately before 
addition to the transcription mixture; 6 and 7, 0.1 and 1 \l% of MBP-NACq (1.3 
and 13.3 pmol of monomers) treated with factor Xa for 30 min immediately 
before addition to the transcription mixture; 8, 3 fxg of MBP-NAC,t (40 pmol of 
monomers) treated with factor Xa for 16 h before addition to the transcription 
mixture. (B) Addition of NAC fragments. Lanes: 1, no NAC; 2 and 3, 0.07 and 
0.7 jjLg of NAC (1.1 and 10.5 pmol of dimers) purified from K. aerogenes by salt 
precipitation; 4 and 5. 0.1 and 1 M.g of MBP-NACeIOO (1.9 and 19 pmol of 
monomers) treated with factor Xa for 30 min immediately before addition to the 
transcription mixture; 6 and 7, 0.13 and 1.3 jxg of MBP-NACk129 (1.9 and 19 
pmol of monomers) treated with factor Xa for 30 min immediately before 
addition to the transcription mixture; 8 and 9, 0.14 and 1.4 i^gof MBP-NACk305 
(1.9 and 19 pmol of monomers) treated with factor Xa for 30 rain immediately 
before addition to the tran.scription mixture. 



comparison to NAC^ in vitro, we constructed a gene encoding 
NACe ^h^t was modified at the N terminus with MBP and at 
the C terminus with his^ (MBP-NACE-hiSg). We hoped to use 
the hiSft tag to separate the NACe fragment from factor Xa 



NAC source 
and strain" 


Flasmid^ 


NAC product*" 


Sp act^ of: 
Histidase GDH 


E. coU 










KC2725 


None 


None 


31 


413 


KC3568 


pCB774 


NACt 


415 


46 


KC3210 


pCB594 


MBP-NACe 


34 


435 


KC3555 


pCB771 


NACe-hiSe 


890 


21 


KC3973 


pCB941 


NACelOO-his^, 


42 


354 


K, aerogenes 










KC3572 


pCB778 


NACk 


520 


59 


KC3981 


pCB606 


MBP-NACk 


37 


417 


KC3578 


pCB781 


NACK-hisg 


650 


35 


KC3580 


pCB782 


NACk-DFGRSGHDTSL 


720 


32 


KC3615 


pCBSOO 


NACKl20-his5 


295 


ND* 


KC3591 


pCB788 


NACKlOO-hiSfi 


45 


335 



° All strains are derived from K. aerogenes KC2725 {nac'203/r^n5-231), grown 
in glucose minimal medium to mid-log phase, and assayed as previously de- 
scribed (12). IPTG (0.5 mM) was included for strain KC3981 to guarantee 
expression from the pMalC2 vector. 

^ All plasmids except pCB594 and PCB606 were derived from pGD103 as 
described in Materials and Methods. Plasmids pCBS94 and pCB606 were derived 
from pMalC2. 

The code is described in detail in the text. Subscripts indicate derivation from 
E. coli (E) or /C aerogenes (K), MBP indicates MBP fused to the N terminus of 
NAC. hiSft indicates six histidine residues added to the C terminus. DFGRS- 
GHTDSL indicates fusion of 11 amino acids to the C terminus of NAC (amino 
acids are indicated by the one-letter code). NAClOO and NAC 120 indicate that 
onlv the first 100 or 120 amino acids of NAC were present. 

''Specific activity is reported as nanomoles per minute per milligram of cell 
protein. 

" ND, not determined. 



after cleavage of MBP-NACe-hiSe. However, MBP-NACe-hiSf, 
was completely resistant to cleavage by factor Xa, in marked 
contrast to MBP-NACe- O^^Y the addition of strong denatur- 
ants allowed MBP to be removed, and this was accompanied by 
very rapid disappearance of NACe- No active NAC^ was re- 
covered after attempts to renature the material. Apparently, 
the C-terminal his^ tag had altered the factor Xa cleavage site 
in MBP-NAC^-hiSft so as to make it inaccessible to cleavage by 
factor Xa, suggesting that the C terminus of the folded protein 
is close to the N terminus. Thus, the C-terminal hiSg tag would 
be able to prevent the protease from reaching the cleavage site 
at the N terminus of the NAC polypeptide. 

Subunit structures. We next used the tagged NAC polypep- 
tides to search for the dimerization site on NAC. These data 
(along with a summary of the in vitro activation data and 
DNA-binding data) are presented in Table 4. NAC^ and 
NACk with an N-terminal MBP tag eluted as monomers. If 
factor Xa was added, the elution profile was complex and 
included a peak of the size expected for MBP-NAC (un- 
cleaved, monomer), MBP (monomer), NAC (dimer), and an 
unknown peak with an apparent size of 8.5 kDa (presumed to 
be a degradation product of the 33-kDa NAC monomer). 

A smaller N-terminal modification was also tested by fusing 
a 56-amino-acid peptide that included a his6 tag in its midst to 
the N terminus of NAC^ and NACg. Both of these fusions 
were inactive in vivo (data not shown). The N-terminally hiSg- 
tagged NACg and NACk were purified by Ni affinity chroma- 
tography. The tagged NAC^ was found to be dimeric in solu- 
tion even before cleavage (NACg was not tested). However, 
neither the NAC^ nor the NAC^ fusion protein was able to 
activate transcription in vitro (Table 4), Cleavage of the tagged 
NACe with thrombin removed the hisg domain but left an 
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TABLE 4. In vitro properties of modified forms of NAC 



Construct 


Apparent 
molecular size in 
solution (kDa) 


Form^ 


Gel 
mobility 
shift^ 


activation'^ 


NACe 




I 






MBP-NACk 


75 (66) 


M(D) 


Y(Y) 


(Y) 


56his6-NACE 






Y(Y) 


N(N) 


NACtr-hiSft 




I 


NACelOO-hiSfi 




I 






MBP-NAChIOO 








(Y) 


NACk 


66 


D 


Y 


Y 


MBP-NACk 


70 (66) 


M(D) 


Y(Y) 


N(Y) 


56his6-NACK 


77 


D 


Y 


NACK-hiSft 




Y 


Y 




MBP-NACk129 


56 (28) 


M(D) 




(Y) 


NACKl20-his6 


27 


D 


Y 


Y 


NACKlOO-hiSf, 


22 


D 


Y 





" NACjE, NACk, '^^^ several modified forms thereof were purified as described 
in Materials and Methods. The apparent molecular size in solution was deter- 
mined by gel filtration, and this size was used to infer whether the product was 
monomeric or dimeric in solution. The ability to bind DNA was determined in a 
gel mobility shift assay using the urease operon promoter {ureDp) as a target. 
The ability to activate transcription from the hiuUH promoter {htitUp) was tested 
as described in the legend to Fig. 2. The values in parentheses were determined 
after cleavage with Factor Xa or thrombin. 
M, monomer; I, insoluble; D, dimer. 
Y, yes; N, no. 



18-amino-acid peptide fused to the N terminus of NACe- This 
cleavage product was also unable to activate transcription in 
vitro (Table 4). In summary, fusions to the N terminus of 
NACe increase its solubihty, but even small additions at the N 
terminus block its ability to activate transcription. Large addi- 
tions (MB?) block the ability to dimerize in solution (Table 4). 

Somewhat surprisingly, the C-terminally his^-tagged NAC 
fragments of NAC^ (NACjclOO-hiSe and NACKl20-his6), as 
well as the product released from MBP-NACk129 by (factor 
Xa) cleavage, were all dimers in solution as measured by gel 
filtration. Thus, the N- terminal domain of 100 amino acids also 
appears to contain the signals required for dimer formation, as 
well as for activation of transcription. 

DISCUSSION 

Perhaps the most surprising finding in these experiments is 
that the N-terminal 100 amino acids of NAC contain the de- 
terminants for all of the known functions of NAC. NAClOO 
was able to activate histidase formation in vivo and hutUp 
transcription in vitro. NAClOO was able to repress GDH for- 
mation in vivo. NACj^lOO-hiSft was able to bind a DNA frag- 
ment with a NAC-binding site, and NACKlOO-his5 was present 
as a dimer in solution. In short, NAC^lOO (or NAC^lOO-hiSe) 
displayed all four known properties of NAC: activation of 
transcription, repression of transcription, site-specific DNA 
binding, and dimerization. 

All of the truncations of NAC isolated in these experiments, 
except NACk300, showed considerable NAC activity. How- 
ever, several tight nac mutants have been isolated before by 
using selection and screening. One of these, nac-203\:Tx\5 , has 
been sequenced and is an insertion of Tn5 after the 199th 
codon of naCf^. The data in Fig. 1 suggest that this mutation 
should leave NAC partially active, but nac-203 is one of the 
tightest nac mutants in our collection (although even nac-203 
retains some activity, particularly on the ure operon promoter). 
Thus, it seems likely that NACk300 and NACk199 are inactive 



because of instability or misfolding rather than because of any 
lack of informative sequence. 

The data presented here also provide some information 
about the function of the N and C termini of NAC. Additions 
to the N terminus block the ability of NAC to activate tran- 
scription, and small additions to the C terminus increase this 
activity, at least in vivo. The effect of the N-terminal additions 
appears to be intrinsic to the protein, since it is also seen in 
vitro with purified protein. Moreover, the ability to activate 
transcription can be recovered (in the case of MBP-NAC) by 
removing the N-terminal addition. The effect of the C-terminal 
additions may likewise be intrinsic to the protein, but it may 
also reflect an increased stability of the C-terminally modified 
NAC, We have been unable to raise a high-titer antibody with 
sufficient specificity for NAC to address this question directly, 
but we have previously suggested that wild-type NAC is rapidly 
turned over in vivo (14). 

Two lines of evidence led us to suspect an interaction be- 
tween the N and C termini of NAC and perhaps other LTTRs. 
First, MBP-NAC-his^ was completely resistant to cleavage by 
factor Xa, while MBP-NAC was readily cleaved. Second, other 
work in this laboratory (1) has shown that replacement of the 
amino-terminal 20 amino acids of NAC with the corresponding 
amino acids from OxyR results in an inactive NAC unless the 
C-terminal domain of this NAC-OxyR chimera is also replaced 
with the C-terminal domain from OxyR. 

The helix-turn-helix region thought to be the DNA-binding 
site is located very close to the N terminus of LTTRs. (about 
amino acids 20 to 40 for NAC). The binding site for coeffectors 
of LTTRs has generally been found in the C-terminal domain, 
between amino acids 100 and 2(H} (15). Thus, an effect of the 
C-terminal domain on the N-terminal domain must exist. The 
specificity requirement in the OxyR-NAC chimeras (1) and the 
ability of the C-terminal his^ tag to prevent cleavage of MBP 
from the N terminus of MBP-NAC-hiSg seem more consistent 
with a hypothesis of direct interaction than with transmission 
of a signal through the middle part of the protein. However, in 
the absence of experiments to test a direct interaction, such 
arguments remain speculative at best. 

It was exciting to discover that the four known regulatory 
properties of NAC are encoded within a NAC fragment con- 
taining only the N-terminal 100 amino acids, NAClOO. It was 
disappointing to discover that a fifth property of NAC, the 
insolubility of NAC^, was also encoded within this fragment. 
Most of the amino acid differences between NAC^ and NAC^ 
lie in the C-terminal domain; the N-terminal domains are more 
similar (14). Nevertheless, even NAC^lOO-his^ is much more 
insoluble than NACKlOO-his^. Thus, future studies will con- 
tinue to focus on NACk ^o*" vitro studies. 
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Long-chain L-a-hydroxy acid oxidase from rat kidney is a member of the family of FMN- 
dependent a-hydroxy-acid-oxidizing enzymes. With the knowledge of the recently determined amino 
acid sequence, the cDNA encoding the enzyme has now been cloned using the polymerase chain 
reaction. The 1648-bp cDNA contains an open reading frame coding for the 352 residues of the 
previously determined sequence, preceded by a methionine codon. In addition, several clones were 
found to present a nine-base insertion, predicting the existence of an isoform with a tripeptide VRK 
inserted between residues 188 and 189 of the mature protein. The presence of about 10% of this 
isoform in the oxidase purified from rat kidney was indeed identified by amino acid sequencing. A 
recombinant active enzyme was obtained as a protein fused to glutathione 5-transferase using the 
bacterial expression plasmid pGEX-3X. Physico-chemical characterization indicated, for the fused 
enzyme, properties similar to those of the rat kidney protein. When the chimaera was submitted to 
factor Xa, proteolysis at the engineered cleavage point was poor. Separation of hydroxy acid oxidase 
from glutathione 5-transferase could not be achieved with trypsin either. With both proteases, the 
initial cleavage point appeared to be in a peptide loop internal to the hydroxy acid oxidase sequence, 
close to or in the tripeptide insertion locus and not at the engineered factor-Xa-cleavage point. 
Comparative tryptic proteolysis of the rat kidney enzyme yielded a form cleaved in the same loop. 



Long-chain a-hydroxy acid oxidase is an FMN-depen- 
dent dehydrogenase-oxidase, first described by Blanchard et 
al. (1946). It preferentially oxidizes long-chain aliphatic and 
aromatic L-a-hydroxy acids in contrast to isozyme A, gly- 
colate oxidase, which preferentially oxidizes the shortest a- 
hydroxy acid. The rat kidney enzyme was first isolated as an 
enzyme with weak L-amino acid oxidase activity (Blanchard 
et al., 1945). More recently. Brush and Hamilton (1981) sug- 
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Abbreviations. PCR, polymerase chain reaction; HAO, long- 
chain L-2-hydroxy acid oxidase; ss, single stranded; IPTG, isopro- 
py! ^-D-thiogalactopyranoside; Cl^Ind, 2,6-dichlorophenol indo- 
phenol; GST, glutathione S-transferase ; ANC, anchor adaptor. 

Enzymes. L-amino acid oxidase [(5)-amino acid: oxygen 2-oxi- 
dorcductasc] (EC 1.4.3.2); factor Xa (EC 3.4.21.6); glutathione S- 
transferase (EC 2.5.1.18); glycolate oxidase [(5)-hydroxy acid: ox- 
ygen 2-oxidoreductase, isozyme A] (EC 1.1.3.15); L-hydroxy acid 
oxidase [(5)-hydroxy acid:2-oxygen oxidoreductase; isozyme B] 
(EC 1.1.3.15); (5)-lactate: cytochrome c 2-oxidoreductase or flavo- 
cytochrome (EC 1.1.2.3); L-lactate oxidase [(.S)-lactate: oxygen 2- 
oxidoreduciase (decarboxylating)] (EC 1.13.12.4); trypsin (EC 
3.4.21.4). 

Note. The novel nucleotide sequence data published here have 
been submitted to the EMBL, GenBank and DDBJ sequence data 
banks and are available under the accession number X67156. 



gested thiolglyoxylate adducts were its true physiological 
substrates, even though these compounds are not processed 
at a higher rate than hydroxybutyrate. This peroxisomal en- 
zyme is found together with isozyme A in hog, beef and 
sheep kidney (Robinson et al., 1962; McGroarty et al., 1974; 
Zaar and Fahimi, 1991), whereas it represents the only a- 
hydroxy-acid-oxidizing activity in rat kidney (Blanchard et 
al., 1946; Nakano and Danowski, 1966; McGroarty et al., 
1974; Angermuller et al., 1986; Zaar and Fahimi, 1991). 

The recentiy determined amino acid sequence of rat kid- 
ney long-chain hydroxy acid oxidase (HAO) (Le and Led- 
erer, 1991) established homology between this protein and 
other FMN-dependent a-hydroxy-acid-oxidizing enzymes, 
including glycolate oxidase from spinach (Volokita and Som- 
erville, 1987; Cederlund et al.. 1988), lactate oxidase from 
Mycobacterium smegmatis (Giegel et al., 1990), mandelate 
dehydrogenase from Pseudomonas putida (Tsou et al., 1990) 
and flavocytochrome bz or L-lactate dehydrogenase from ye- 
asts (Lederer et al., 1985; Guiard, 1985; Black et al., 1989; 
Risler et al., 1989). The reaction catalyzed by all these en- 
zymes occurs in two stages. In the reductive half-reaction, 
the substrate is oxidized and FMN reduced; in the oxidative 
half- reaction, the flavin is reoxidized. Mechanistic studies, 
combined with sequence comparisons and the knowledge of 
the three-dimensional structures of glycolate oxidase and fla- 
vocytochrome bi (Lindqvist, 1989; Xia and Mathews, 1990; 
Lindqvist et a]., 1991), have led to the idea that the flavin 
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reductive half-reaction has a very similar, if not identical, 
mechanism for all these enzymes (Ghisla, 1982; Urban and 
Lederer, 1985; Lederer and Mathews, 1987; Urban et al., 
1988; Lindqvist and Branden, 1989; Giegel et al., 1990; 
Lederen 1992). In contrast, the oxidative half-reaction is not 
identical; the oxidases utilize oxygen with production of hy- 
drogen peroxide, whereas the flavocytochrome i>2 flavin has 
a built-in monoelectronic acceptor, heme ^2- The physiologi- 
cal acceptor for mandelate dehydrogenase is not known. 
Clues as to the structural differences which lead to the differ- 
entiation between the dehydrogenase/oxidase and the de- 
hydrogenase/electron transferase function were suggested by 
the comparison between the three-dimensional structures of 
flavocytochrome bi and glycolate oxidase (Lindqvist et al., 
1991). 

Nowadays, one efficient way of furthering the under- 
standing of structure/function relationships is the mechanistic 
comparison between wild-type enzymes and site-directed 
mutants. Such a study is underway for flavocytochrome b2 
from Saccharomyces cerevisiae (Reid et al., 1988; Dubois et 
al., 1990; Miles et al., 1992). in view of a number of kinetic 
similarities between this enzyme and HAO (Cromartie and 
Walsh, 1975b; Pompon et al., 1980; Urban and Lederer, 
1984; Urban et al., 1988), we decided to use this same ap- 
proach with HAO in order to pinpoint the requirements for 
flavin reactivity with oxygen. As a first step in this direction, 
we report here the cloning of the cDNA encoding HAO from 
rat kidney and its expression in Escherichia coli as a cata- 
lytically active enzyme fused with glutathione 5-transferase. 



MATERIALS AND METHODS 
Materials 

The plasmid pKS M13+/-, E. coli strains XLl-blue and 
HB 101 were from Stratagene. Expression plasmids pGEX- 
3X and pKK233-3 were Horn Pharmacia. All restriction cn- 
donucleases, modifying enzymes and factor Xa were from 
New England Biolabs, while Mu-MLV reverse transcriptase, 
terminal deoxynucleotidyl transferase and Taq DNA poly- 
merase were from Bethesda Research Laboratories. To- 
sylphenylalanine-chloromethane-treated trypsin was from 
Merck. DNA sequencing was performed on double- stranded 
DNA using modified T7 DNA polymerase (sequenase) from 
U. S. Biochemicals. [a-^^P] dCTP and [a-^^S] dATP were 
purchased from Amersham Corporation. All other chemicals 
were of reagent grade. Oligodeoxynucleotides were synthe- 
sized on a model 391 Applied Biosystems DNA synthesizer. 

Isolation of mRNA and Northern-blot analysis 

Total RNA was extracted from fresh kidney tissue from 
one Wistar rat with the acid/phenol method (Chomczynski 
and Sacchi, 1987). Poly(A)-rich RNA was isolated from total 
RNA by o]igo(dT)-ceIlulose chromatography (Aviv and 
Leder, 1972). Total and poly(A)-rich RNA were subjected 
to electrophoresis on a 0.8% agarose/formaldehyde gel 
(Lehrach et al., 1977), transferred to a nitrocellulose filter, 
hybridized with the 362-bp polymerase chain reaction (PCR) 
fragment and radiolabelled by random priming (Feinberg and 
Vogelstein, 1983). Hybridization was at 65**C in Church's 
solution (Church and Gilbert, 1984). Washing was at a final 
stringency of O.IX solution A at 60°C (20X solution A is 
3 M NaCl, 0.2 M NaH^PO*, 0.02 M EDTA, pH 7.4). 



Internal-cDNA amplification 

Single-stranded (ss) cDNA was synthesized from poly- 
(A)-rich kidney RNA (5 |ig) using Mu-MLV reverse tran- 
scriptase and oligo-(dT)i5 as a primer. Amplification, using 
the 'mixed oligonucleotide primed amplification of cDNA' 
procedure (Lee et al., 1988; Girgis et al., 1988), was carried 
out in 50 |il with 100 ng ss cDNA and 200 pmol each primer 
mixture (XI and X2), After 30 cycles (94*=^C, 1 min; 37X, 
2 min; 72**C, 1.5 min), the reaction mixture was fractionated 
by electrophoresis (1 % low-melting-point agarose in 40 mM 
Tris/acetate, 1 mM EDTA, pH 7.2) and aliquots from five gel 
slices, corresponding to the 200— 400-bp region, were further 
amplified in the same conditions for 30 cycles. 

5'-end amplification 

dG-tailing of ss cDNA was performed according to Dent 
and Wu (1984). Amplification was carried out in 50 nl with 
8 ng dG-tailed ss cDNA, using gene-specific X3 (40 pmol) 
and oligo-(dC),5 (200 pmol). After 30 cycles (94 °C, 1.5 min; 
45 X, 1.5 min; 72**C, 1.5 min), the reaction product was iso- 
lated by electrophoresis (1 % low-melting-point agarose) and 
an aliquot was reamplified with gene-specific X4 and oligo- 
(dC)i5 (5 pmol each) for 30 cycles as described above. 

3'-end amplification 

5 ^ig poly(A)-rich RNA was reverse transcribed in 50 jil 
as described above, except for substitution of 45 pmol (dT)i7- 
ANC to oligo-(dT),5 primer. The reaction mixture was di- 
luted to 1 ml with solution B (10 mM Tris/HCl, 1 mM 
EDTA, pH7.5) and stored at 4°C. The cDNA pool (5 ^1) 
was amplified in 50 \i\ with gene-specific X5 and an anchor 
adaptor (ANC) as primers (25 pmol each), using 55 °C as 
annealing temperature and 1 .5 min for the extension step at 
72 °C (Frohman and al„ 1988). Two additional rounds of am- 
plification were performed with the nested X6 and X7 pri- 
mers for the 5'-end and ANC for the 3'-end. 

Cloning and sequencing of cDNA 

PCR products were extracted with phenol/chloroform and 
ethanol precipitated. The Klenow fragment and T4 polynu- 
cleotide kinase were used to fill-in ends and phosphorylate 
5'-termini respectively (Sambrook et al., 1989). 

After electrophoresis (1% low-melting-point agarose), 
the region of the gel containing specific products was iso- 
lated and the DNA was cloned in the Eco^Csf or Smal site of 
the Bluescript vector. Double-stranded DNA from recombi- 
nant plasmids was prepared using standard procedures 
(Sambrook et al., 1989) and sequenced with modified T7 
DNA polymerase (sequenase), following the dideoxy-chain- 
temfiination procedure (Sanger et al., 1977) according to ihe 
reconmiendations of the supplier. 

Attempts at direct expression of HAO cDNA 

The entire coding region of HAO was obtained by PCR 
amplification, using suitable primers, of the ss cDNA ob- 
tained from poly(A)-rich RNA. It was then cloned into the 
A^col site of the prokaryotic expression vector pKK233-3. 
The plasmid was introduced into E. coli strain JM 105. After 
induction with 1 mM isopropyl )9-D-thiogalactopyranoside 
(IPTG) under standard conditions, no band was observed at 
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the expected position on the Coomassie-blue-stained protein 
pattern of a SDS/gel electrophoresis. Changing induction 
conditions, or growth temperature, and introducing the plas- 
mid into other strains (HB 101, XLl-blue, AR 120, N5151) 
were unsuccessful. 



Construction of the expression plasmid carrying rat 
hydroxy acid oxidase cDNA fused to glutathione 
^-transferase cDNA 

Procaryotic expression vector pGEX-3X (Smith and 
Johnson, 1988) was linearized with Smal and treated with 
calf intestinal alkaline phosphatase. This vector was then lig- 
ated to the entire coding region of HAO cDN A obtained after 
PCR amplification. The recombinant plasmid (pGHAOX) 
was introduced into E, coli strain XLl-blue. 



Expression of hydroxy acid oxidase 

E. coli transformed with pGHAOX was cultured over- 
night in Luria-Bertani medium containing 50 M^g/ml ampicil- 
lin and 12.5 i^g/ml tetracycline at 37 °C. The culture was then 
diluted 1:50 into 4 1 of the same medium and incubated at 
37 °C with good aeration. When the absorbance, measiured at 
600 nm, reached a value of 0.4, IPTG was added to a final 
concentration of 1 mM and incubation was continued for 3 h. 
Aliquots of cells (1 ml) were harvested before and after in- 
duction, pelleted, resuspended in 100 ^1 SDS-loading buffer, 
then subjected to 0.1 % SDS/10% PAGE (Laenmili, 1970). 
Protein bands were visualized by Coomassie-blue staining. 

Purification of hydroxy acid oxidase 

Rat kidney enzyme was piuified as described by Urban 
et al. (1988). For obtaining the recombinant enzyme, frozen 
cells from a 4-1 culture, grown as described above, were 
thawed and lysed by sonication in 40 ml NaCl/Pj-Triton 
(150 mM NaCl, 20 mM sodium phosphate, \% Triton X- 
100, pH 7.3). After centrifugation, the supernatant was 
loaded on a glutathione— Sepharose-4B column (4 ml total 
volume) under conditions recommended by the manufac- 
turer. After elution with 5 mM glutathione in 50 mM Tris/ 
HCl, pH 8, the fusion protein (300 nmol) was precipitated 
by addition of ammonium sulphate to 70% saturation; the 
precipitate was dissolved in a minimal volume of 50 mM 
Tris/HCl, 10 pM FMN, pH 8.0, dialyzed against the same 
buffer and stored at -80**C. 



Molecular-sieve chromatography 

An AcA34 column (1.6 cm X 80 cm) was equilibrated in 
1(X) mM sodium/potassium phosphate buffer, 1 mM EDTA, 
pH7, at a flow rate of 15 ml/h. A mixtm-e of 1.2 mg 
flavocytochrome £>2 [M, 230000 (Lederer et al. 1985)], 
2.7 mg glutathione-5-transferase— HAO chimaera (GST, glu- 
tathione 5-transferase) and 0.9 mg rat kidney hydroxy acid 
oxidase [M^ 156000 (Le and Lederer, 1991)] was chromato- 
graphed on the column at the same flow rate. The efHuent 
was monitored at 280 tun. Protein-containing fractions were 
assayed for mandelate/Clz-Ind oxidoreductase activity; 
(HAO activity; Cljind, 2,6-dichlorophenoI indophenol) and 
lactate-ferricyanide oxidoreductase activity (flavocytoch- 
rome ^2)- HAO can oxidise lactate but its specific activity 
with this substrate is more than 250*fold lower than that of 



flavocytochrome the latter cannot oxidize mandelate 
(Lederer, 1991). 

Enzymic assays 

The standard activity assay solution consisted of 25 mM 
D,L-2-hydroxybutyrate, 70 ^iM CU-Ind (0.002%) in 0.1 M 
Tris/HCl. pH 8.4, and 1 mM EDTA. Dye reduction was 
monitored with a Uvikon 930 spectrophotometer at 30°C. 
An absorption coefficient of 22000 M~* cm"' was used for 
C]2lnd at 600 nm. The enzyme activity unit is expressed 
in nmol dye reduced • s^^ - mol enzyme"*. Protein concen- 
trations were determined by the Bio-Rad protein microassay 
procedure using bovine serum albumin as a standard. The 
enzyme concentration was expressed relative to its FMN 
content (£450= 11700 M"' cm"^). 

Proteolysis with trypsin and factor Xa 

Proteolysis with factor Xa was performed in the buffer 
described by Nagai and Th0gersen (1984), at a protease/sub- 
strate ratio of 1 : 100. Proteolysis with trypsin was performed 
in 50 mM Tris, pH 8, at a protease/substrate ratio of 1 : 5(X) 
at 0°C. At each given time, aliquots were removed and 
quenched with 1 voL 60 ^M tosyllysyl chloromethane in 
100 mM imidazole/HCl, pH 7.5. 

Peptide sequencing 

SDS/PAGE was carried out m Tris/borate (45 mM Tris, 
44 mM boric acid, 2 mM EDTA. 0.1 % SDS, pH 7). Electro- 
transfer was carried out in the buffer in the absence of SDS, 
with the semidry apparatus from Millipore (Miniblot-SDE) 
at 0.2 mA/cm^ for 1 h. Automated Edman degradation was 
carried out on the excised spots with a gas/liquid-phase se- 
quencer model 477A from Applied Biosystems equipped 
with on-line phenylthiohydantoin amino acid analysis. 



RESULTS 

Isolation of the cDNA of rat Iddney hydroxy acid oxidase 

Knowledge of the amino acid sequence of the enzyme 
(Le and Lederer, 1991) was used for synthesizing two sets 
of degenerate oligonucleotides, XI and X2 (Table 51), as 
primers for amplifying a 362-bp internal fragment of ss 
cDNA from rat kidney, using PCR (Figs SI and S2). After 
clotiing and sequencing, the amplified DNA showed an open 
reading frame corresponding to the expected amino acid se- 
quence. Non-degenerate oligonucleotides, corresponding to 
various parts of this internal sequence, were then chosen to 
obtain the full-length cDNA sequence following the an- 
chored PCR protocol (Loh et al., 1989; Table SI and Figs SI 
and S2). For 5'-end amplification, X3 and (dC),5 were used 
after dG-tailing of the ss cDNA. The product was fiuther 
amplified using (dC)t5 and the nested primer X4 in order to 
increase specificity. The resulting 370-bp fragment was 
cloned and sequenced (three independent clones). For the 3'- 
end amplification, poly(A)-rich mRNA was first reverse tran- 
scribed using as a primer (dT),7-ANC (Table SI), according 
to Frohman et al. (1988) (the ANC adaptor contains the Xhol^ 
Sail and Clal restriction sites). The ss cDNA was then sub- 
mitted to three rounds of cunplification using the nested oli- 
gonucleotides X5, X6 and X7 successively for priming the 
5'-side and ANC for the 3'-side. Two fragments of about 1.1 
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T^ble 1. Comparison of kinetic parameters of the gIutatiiione-5-transferase-iiydroxy-acid-oxidase chimaera and of rat kidney 
hydroxy acid oxidase. The kinetic determinadohs were carried out in 0.1 M imidazole/HCl, pH 7^ at 30 ^C. n.d., not determined. 
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HAO (Urban et al.. 1988) 


1.0 ±0,1 


1.02 




0.02 


1.0 


0.40 ± 0.02 


1.52 ± 


0.08 


3.8 



kb were obtmned, subcloned and sequenced, using the strat- 
egy shown in Fig. S3. 

Fig. S2 shows the nucleotide sequence deduced for the 
cDNA of the HAO gene by merging the information obtained 
from sequencing the three PCR fragments; the 5'-end, the 
central part and the lower- molecular- mass 3'-end. The se- 
quence contains a 5'-untranslated region of 202 bp, an open 
reading frame of 1059 bp and a 3'-un translated region of 
387 bp [excluding the poly(A)-rich tail]. The coding region 
corresponds to the expected HAO amino acid sequence, with 
an additional methionine before Prol of the mature protein. 
As no other ATG codon is found further upstream in the 
correct reading frame, this must be the initiator codon, even 
though, of the consensus sequence for the initiation of pro- 
tein synthesis in eukaryotes, only G at position — 3 and C at 
position —5 are present (Kozak, 1984, 1986). The methio- 
nine must be removed by proteolysis during biosynthesis. 
No N-terminal signal sequence is expected for HAO, which 
possesses the C-terminal peroxisomal targeting sequence 
SRL (Gould et al., 1990). 

The higher-molecular-mass PCR fragment of about 1.1 kb 
corresponding to the 3'-end had, after position 568 for the 
three clones that were examined, a nine-base insertion corre- 
sponding to TGAGAAAGG. A nine-base 3 '-terminal exten- 
sion (AAAAATGAA) was also present. The nine-base inser- 
tion corresponds to a 3 amino acid insertion (VRK) between 
El 88 and El 89 of the mature protein. The presence of these 
residues had not been detected in the amino acid sequence 
work (Le and Lederer, 1991). 

Tissue distribution of hydroxy acid oxidase mRNA 

Northern-blot analysis of rat kidney poly(A)-rich RNA, 
using, as a probe for hybridization under stringent conditions, 
the 362-bp internal PCR fragment obtained with XI and X2, 
revealed the presence of two strong bands of «1.8 kb and 
2.0 kb (Fig. lA). These bands were also easily visualized in 
total kidney RNA. No signal was observed in total RNA 
samples from other rat tissues (lung, heart, brain, muscle, 
liver and testis; Fig. IB). This confirmed the tissue speci- 
ficity of the kidney enzyme (Tolbert, 1981). The length of 
the cloned cDNA (1648 bp) suggested it corresponded to the 
lower-molecular-mass mRNA. 

Expression of hydroxy acid oxidase in E. coli 

as a fusion protein with glutathione ^S-transferase 

The full-length coding region, starting at the Prol codon, 
was amplified by PCR from total kidney ss cDNA using 
primers corresponding to the 5'-end and 3'-end of the coding 
sequence (20 nucleotides). Its sequence was checked and 
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Fig. 1. Size and tissue distribution of hydroxy acid oxidase 
mRNA. 5 Jig poly(A)-rich RNA from rat kidney (A) and 10 ng total 
RNA from difTerent rat tissues (B) were analysed by Northem-blot 
analysis using, as a probe, the 362-bp fragment obtained with XI 
and X2 as primers. In (B), RNA was from lung (1), heart (2), brain 
(3), muscle (4), liver (5), kidney (6) and testis (7). The position of 
the 1 8S and 28S ribosomal RNA is indicated. The arrows highlight 
the two bands observed for hydroxy acid oxidase mRNA. 



found to correspond to that shown in Fig. S2, i.e. it did not 
present the VRK insertion. Initial attempts to express HAO 
as an unfiised protein using the pKK233— 3 bacterial ex- 
pression vector were unsuccessful, despite a correct sequence 
and the use of five different E. coli strains. As an alternative, 
the HAO coding region was subcloned into the prokaryotic 
expression vector pGEX-3X. The resulting construct is ex- 
pected to yield a fusion protein with GST from Schistozoma 
japonicum. Moreover, the chimaeric protein has been de- 
signed to contain a specific proteolytic cleavage site for bo- 
vine factor Xa, between the N-terminal GST sequence and 
the protein to be expressed (lEGR/GIP etc). Insertion of the 
HAO coding region at the Smal site, combined with factor- 
Xa cleavage of the fusion protein, should thus yield the oxi- 
dase with an N-terminal extension of three residues (GIP) 
before Pro 1 of the natural protein. The recombinant plasmid, 
pGHAOX, was introduced into E. coli strain XLl-blue. A 
number of clones were examined for protein expression after 
induction with IPTG. Their total protein content was ana- 
lyzed by SDS/PAGE and Coomassie-blue staining after cell 
lysis. Fig. 2A shows a typical example of a clone overex- 
pressing a 65-kDa protein found only after IPTG treatment 
of the bacterial culture. The protein was absent from the cells 
transformed with the original pGEX-3X, which conversely 
expressed a 26-kDa protein corresponding to the expected 
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Fig. 2. Expression of rat kidney hydroxy acid oxidase in £. coli, 

(A) Total celJ proteins on 12.5% SDS/PAGE. 1-ml cultures were 
harvested before (— ) and after induction with 1 mM IPTG for 
3h; the pellet was dissolved in loading buffer, and one half was 
electrophoresed after boiling. Lane 1, molecular- mass markers; lane 
2, XLl-blue transfected with pGEX-3X; lane 3, XLl-blue trans- 
formed with pGHAOX; lane 4, hydroxy acid oxidase purified from 
rat kidney. (B) Purified fusion protein after chromatography on glu- 
tathione— Sepharose 4B. 



GST. The recombinant fusion protein expressed by cells 
transfected with pGHAOX had the size expected for the 
GST-HAO fusion protein, and it represented about 5% of the 
total protein content. 



Enzymic and physicochemical characterization 
of the GST/HAO chimaera 

The fusion protein was purified by affinity chromatogra- 
phy on glutathione— Sepharose-4B. Cells from an IPTG -in- 
duced culture were sonicated in the presence of 1% Triton 
X-100 and the crude supernatant was adsorbed on the affinity 
colunui. After all the Triton X-100 and most of the protein 
had been washed out from the column, elution was earned 
out with glutathione in the absence of detergent. This indi- 
cated that the fusion protein was soluble. Coomassie-blue 
staining of an SDS/polyacrylamide gel showed a single pro- 
tein band with the expected of 65000 (Fig, 2B). Upon 
chromatography on an AcA34 molecular-sieve colunui (see 
Materials and Methods), the HAO activity peak was eluted 
one fraction volume ahead of the lactate-oxidizing activity 
of flavocytochrome bz^ Since the latter is a tetramer of 
4X57500, this indicated a for the chimaera compatible 
with a tetrameric structure (M, 4 X 65000). The enzyme solu- 
tion was yellow, which indicated the presence of oxidized 
flavin. Its absorption spectrum showed the two characteristic 
flavin peaks, with maxima at 454 nm and 339 nm (Fig. 3). 
Relative to the spectrum of the enzyme purified from rat 
kidney, the latter peak presented a slight blue shift, probably 
due to interference by the tailing of the higher ultraviolet 
peak at 275 nm (arising from the GST chromophores). 

The enzymic parameters of the fusion protein were deter- 
mined for two substrates; D,L-2-hydroxybutyrate and L-man- 
delate. The results are compared in Tabic 1 to those obtained 
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Fig. 5. Absorption spectrum of the purified glutathione S-trans- 
ferase/hydroxy acid oxidase cliimaera (21 enzyme in 50 mM 
TMs/HCl, pH 8.0). The insert shows the visible spectrum at a higher 
sensitivity. 



with the enzyme purified from rat kidney. While there is 
excellent agreement with published values relative to man- 
delate (Urban et al., 1988), small differences were observed 
with a-hydroxybutyrate. The values obtained in this work 
with the rat kidney enzyme are no more different from those 
of Urban et al. (1988) than what can be expected from ex- 
periments using different enzyme batches. However, there is 
a small, but real difference, in the for hydroxybutyrate 
between the fusion protein and the wild-type enzyme. 



Proteolytic cleavage of the fkision protein with factor Xa 
and with trypsin 

When factor Xa was used, from two different commercial 
sources, no matter what the molar ratio of protease/sub- 
strate was (1:100 and 1:10), little proteolysis took place 
(Fig. 4A), After a 24-h digestion, the SDS/polyacrylamide gel 
showed, besides a majority of intact fusion protein, two faint 
bands with apparent M, 43000 and 180(X), similar to those 
obtained with trypsin. It thus appeared that the engineered 
factor-Xa cleavage site (lEGR/GIP) was not accessible to the 
protease in the fusion protein. 

At 0**C with a trypsin/ substrate ratio of 1 :500, the fusion 
protein was rapidly digested and three main bands were ob- 
tained on the SDS/polyacrylamide gel at 43, 26 and 18 kDa 
(the last one called fi; Fig. 4B). Amino acid sequencing of 
the fragments after electroblotting showed fragment to 
correspond to the C-terminal part of HAO, beginning at 
Alal85. The other two fragments presented the N-terminal 
sequence of engineered GST (MSPILGY, etc.). This sug- 
gested that two areas had been cleaved by trypsin in the fu- 
sion protein; the engineered lEGRyOIP sequence between the 
two domains and a region in the HAO sequence also cleaved 
by rat kidney proteases during kidney handling or during 
enzyme purification (Le and Lederer, 1991). 'The relative 
abundance of the higher-molecuiar-mass band suggested that 
the H AO-sensitive region was cleaved first, producing the 
43-kDa fragment (GST plus the 184-residue N-terminal 
HAO fragment a) and an 1 8-kDa fragment (HAO residues 
185 — 352). The 43-kDa fragment could then be cleaved at 
the faclor-Xa cleavage point, yielding protease-resistant GST 
and a protease-sensitive fragment a. Alternatively, after ini- 
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Fig. 4. Proteolysis of the glutathione 5«transferase/hydroxy acid oxidase chimaera (A and B) and of rat kidney hydroxy acid oxidase 

(C). (A) Proteolysis of the fusion protein with factor Xa (protease/substrate molar ratio 1:15; substrate, 25-30 jiM; 50 mM Tris/HCU 
100 mM NaCl, 4 mM CaClj, pH 8.0, 25°C). Lane 1, zero lime; lanes 2 and 3, 30 min and 120 min, factor Xa from Biolabs; lanes 4 and 
5, factor Xa from Sigma Chemical Co (contains bovine serum albumin which gives rise to the second Coomassie-blue-stained band). (B) 
Proteolysis of the fusion protein with trypsin (protease/substrate molar ratio, 1 : 500; substrate, 25-30 nM; 50 mM Tris/HCl, pH 8, 0**C). 
Lane 1, zero time; lane 2, 5 min; lane 3, 2 h. (C) Proteolysis of HAO purified from rat kidney. Same conditions as for (B). 
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Fig. 5. IVyptic cleavage in the protease-sensitive region of HAO 
purified from rat kidney. Arrows indicate the location of the cleav- 
age points as deduced from amino acid sequencing results of frag- 
ment p. 



tial HAO loop cleavage, the GST a fraginent could possibly 
be digested piecemeal yielding GST as a stable end product. 

For verification of the conformational integrity of the 
HAO domain in the fusion protein, the purified rat kidney 
enzyme was submitted to trypsin digestion under conditions 
identical to those used for the chimaera. Both fragments a (a 
doublet at 22 kDa) and P (18kDa) were formed (Fig. 4C), 
confirming that the natural protein contains a protease-sensi- 
tive loop and suggesting that fragment a is more sensitive to 
further digestion by trypsin when it is fused to GST than 
when it is not. The two bands corresponding to fragment a 
had the same N-terminal sequence (PLY, etc.) after electro- 
blotting, showing their difference to lie at their C-termini. 
Amino acid sequencing of fragment yielded two sequences, 
and three in a case of prolonged digestion. The major se- 
quence (ALKEEKP) showed the tryptic cleavage point to lie 
between R184 and A185, as with the fusion protein (Fig. 5). 
The minor sequence, which represented about 10—15% of 
the total amount sequenced, was KEKPTQSV. This can only 
have originated from a protein possessing the VRK insertion, 
which would be cleaved between R and K (Fig. 5). In the 
case of the prolonged digestion, the major sequence was 
identical. The minor sequence was EKPTQ, etc.; it could 
have arisen from a cleavage after the lysine of the VRK in- 



sertion (Fig. 5), but the idea of a less probable but not impos- 
sible tryptic cleavage between El 88 and El 89 of the main 
sequence cannot be totally ruled out (Haumont et al., 1989 
and references therein). It can be added that enzymic activity 
monitoring during proteolysis showed HAO activity to de- 
crease by about 15% over the first 100 min, after which it 
was as stable as that of the control- This suggested that pro- 
teolytic cleavage had only a slight effect on the HAO enzy- 
mic activity. 

DISCUSSION 

Using the information given by the amino acid sequence 
of ct-hydroxy acid oxidase, we used PGR for amplifying, 
cloning and sequencing three overlapping fragments from rat 
kidney cDNA, namely a central coding segment, a 5' frag- 
ment and a longer 3' end. Several independent clones were 
sequenced each time in order to detect possible copying er- 
rors of Taq DNA polymerase. Most of the clones gave the 
sequence shown in Fig. S2, yielding an open reading frame 
identical to the amino acid sequence previously determined. 
However, three out of eight clones of the 3'-fragment carried 
the same nine-base insertion in the open reading frame, 
which would correspond to the insertion of amino acids VRK 
between El 88 and El 89 of the determined amino acid 
sequence. This cannot be ascribed to a copying error. The 
three-residue insertion occurs in a region which is sensitive 
to proteases, as shown in this and previous work (Le and 
Lederer, 1991), and corresponds to disordered loops in the 
three-dimensional structures of the homologous enzymes fla- 
vocytochrome Z?2 (Xia and Mathews, 1990) and glycolate 
oxidase (Lindqvist, 1989). Thus, this region could probably 
accommodate a few more residues without alteration of the 
overall structure. No other sequence difference was detected 
in the cDNA, apart from an extension of the poly(A) tail. 
These results suggested the possible existence of a-hydroxy 
acid oxidase isozymes. While no evidence for amino acid 
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sequence heterogeneity was obtained before (L^ and Lederer, 
1991), during the present work the trj^tic cleavage in the 
loop region of kidney HAO followed by automatic £dman 
degradation provided evidence for the existence of a second 
sequence compatible with the insertion of the tiipeptide 
VRK. The corresponding isoform was calculated to represent 
about 10% of the total protein submitted to degradation. In 
the past, McGroarty et al. (1974) and Cromartie and Walsh 
(1975a) noted the presence of one major and 1 ~3 additional 
faint bands staining for HAO activity on non-denaturing gels. 
The molecular nature of these putative isozymes was not 
elucidated. In contrast, Duley and Holmes (1974) detected 
only one activity band on starch gels. Thus, all the evidence 
obtained to date suggests that a second form of HAO, if 
present, must be minor. 

Alternative splicing of the primary transcript from a sin- 
gle gene could possibly explain the generation of these two 
HAO isoforms, given the sequence identity for the untrans- 
lated regions of their mature mRNA. The existence of the 
minor isoform at the protein level is intriguing from an evo- 
lutionary point of view. One of the major differences between 
the various members of the FMN-dependent a-hydroxy-acid- 
oxidizing enzymes lies in the length of the polypeptide chain 
which loops out of the ^^.a^ barrel between strand and 
helix 04 (Lie and Lederer, 1991). It is in this segment that the 
structurally disordered region is found, as weU as the pro- 
tease-sensitive bonds and the tripeptide insertion. It is so far 
not known whether this region which loops out of the bar- 
rel plays an important role in the structure or function of the 
enzyme. It was, however, found that proteolytic cleavage of 
a single peptide bond in flavocytochrome £>2 led to alterations 
in a number of kinetic parameters {k^, K^^ K, for inhibitors), 
suggesting some kind of interaction, direct or indirect, be- 
tween the active site and the disordered region, either before 
or after cleavage (Ghrir and Lederer, 1981). Moreover, the 
folding of the extension between p4 and 04, with respect to 
the barrel, is very different in glycolate oxidase (Lindqvist, 
1989) and in flavocytochrome i?2 (Xia and Mathews, 1990), 
probably due to the presence of the heme-binding domain. 
Thus, this region of the peptide chain may be a hot spot for 
evolutionary changes in the protein family, and it is tempting 
to speculate that alternative splicing might be a molecular 
mechanism favouring such changes (Craik et al., 1983). The 
data presented in this paper do not exclude, however, the 
existence of closely related genes as an origin of the iso- 
forms. An allelic form of the gene is unlikely because the 
mRNA used for amplification was extracted from a single 
homozygous Wistar rat. It is to be noted that a single cDNA 
sequence was identified for the other known members of the 
family of hydroxy-acid-oxidizing enzymes. 

We used the radiolabeled amplified central fragment for 
characterising HAO mRNA by Northern blotting. Two mes- 
sengers of equal abundance and differing size (about 1800 bp 
and 2(XX) bp) were detected in rat kidney poly(A)-rich RNA. 
They were also detected in total kidney RNA; their ex- 
pression was tissue specific, at least at the sensitivity level 
of the Northern-blot technique. This tissue specificity is in 
keeping with previous observations (McGroarty et al., 1974; 
Zaar and Fahimi, 1991). The origin of the difference between 
the two messengers remains to be elucidated, since their 
mass difference cannot be explained by the sequence differ- 
ences detected in PCR products. 

Hydroxy acid oxidase was expressed as a fusion protein 
with glutathione 5-transferase using the vector pGEX-3X. 
After induction with IPTG, up to 5% of the proteins were 



constituted by the chimaera, expressed as soluble protein. 
The fusion protein was readily purified by affinity chroma- 
tography on inunobilised glutathione. That the folding of the 
chimaeric HAO should be similar if not identical to the 
native HAO is shown by its flavin spectrum, its enzymic 
activity and its similar susceptibility to limited proteolysis by 
Urypsin between positions 184 and 185. The enzymic param- 
eters determined for the substrate L-mandelate are identical 
to those obtained with the enzyme purified from rat kidney. 
The small difference found for the AT^ of D,L-a-hydroxybu- 
tyrate is somewhat surprising, as one would expect a struc- 
tural perturbation due to the fusion to have effects on both 
substrates. Moreover, it can hardly arise from the presence 
of the second HAO form in the natural enzyme, since this 
form is a minor constituent of the purified natural enzyme. 
Whatever the case, even though we could not obtain intact 
recombinant HAO due to the resistance of the chimaera to 
proteolysis, our results open the way to the study of structure/ 
function relationships with mutants whose properties should 
be compared to those of the wild-type fusion protein. 

Manunalian GST enzymes are dimeric (Mannervick and 
Danielson, 1988), but the enzyme from Schistosoma japoni- 
cum is believed to be a monomer (Tiu et al., 1988; Farmer 
et al., 1992). Hydroxy acid oxidase itself is a tetramer (Crom- 
artie and Walsh, 1975b; Duley and Holmes, 1976). All the 
well characterized homologous enzymes were found to be 
either tetramers or octamers (Monteilhet and Risler, 1970; 
Phillips et al., 1976; Sullivan et al., 1977). Furthermore, a 
monomeric form of Hansenula anomala flavocytochrome 
obtained at low ionic strength was found inactive, even 
though it still retained both the flavin and the heme (Baudras, 
1972); activity was recovered together with the tetrameric 
state upon raising the ionic strength. No evidence exists for 
an active monomeric state for any of the other members of 
the family. Furthermore, the crystal structures of glycolate 
oxidase and flavocytochrome show intersubunit contacts 
to be extensive (Lindqvist, 1989; Xia and Mathews, 1990; 
Lindqvist et al., 1991). The GST-HAO chimaera was thus 
expected, as found, to have a GST domain flanking each 
subunit of the tetramer. The three-dimensional structures of 
the homologous enzymes flavocytochrome b^. and glycolate 
oxidase (Lindqvist et al., 1991) show an identical fold start- 
ing at a position equivalent to mature HAO Leu2 (a turn 
followed by a helix). A few residues in. this area, at the sur- 
face of the subunit, are located close to fourfold contacts in 
the two structures. It would be surprising to find a very dif- 
ferent topology at the N-terminus of HAO, in view of its 
degree of similarity with the two other enzymes. This local 
structiwe, therefore, as well as the overall architecture of the 
tetramer, appear compatible with the formation of a more 
complex and active molecular edifice, which retains affinity 
for glutathione and has both flavin-binding capacity and 
hydroxy acid oxidizing activity, even though the fusion site 
is inaccessible to factor Xa. It is interesting to note that, 
while there is a rapid increase in the number of described 
cases of recombinant proteins or domains expressed as fusion 
proteins with GST, the number of active enzymes expressed 
in this way is still rather limited. Furthermore, to our knowl- 
edge, the present case represents the first example of an ac- 
tive multimeric enzyme fiised to GST. 

A. Belmouden was a recipient of a fellowship from the Centre 
National de la Recherche Scientifique (bourse de doctorat pour in- 
g^nieurs). The authors are indebted to D. Mazeila for competent 
secreterial help. 
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CACACCrcCCCAATCAACCACTCTACCCCCCTrrcCATTGCTCCCTCTACCTACACCT -145 
A7TACTGGIXATTCf»AGACCTTACA<nXA<»ATCTTCTTGTCTAAGACCCCT6AATGCTCTCA -73 



Rat HAOX cDNA 



I nternal cDWA fragme nt 



(«IC)15 



X6— ► 
X7-» 



(ANC) 



Fig. SI. Strategy for amplifying and cloning the cDNA of rat 
kidney hydroxy acid oxidase. The exact position in the final se- 
quence and the sequence of the synthetic oligonucleotides XI —X7 
and ANC are given in Table SI. The sequences of X3 and X4 and 
of X5 to X7 do not overlap (see Table SI) although the arrows 
symbolizing these primers are overlapping. 



Table SI. Sequence and position of the primers used in the PGR 
amplification steps. XI, X2 and X7 are degenerate oligonucleotid- 
es. 



Primer 


Nucleotide 
position 


Sequence 


XI 


64- 80 


TGGGATTTTATTGAAGG 
C C C G 
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X2 
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ACCATTTGTTTATTAAAATCCCA 
C C G G 6 
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TGACATATCTCTCAGGTATC 
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TGGAA6ATATTGTTGCTGCTG 
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GTTGGTTCCAACTCTACAT 
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GTCAGACTGGGATTTCAACAA 
C T T 


ANC 




GACTCGAGTCGACATCG 



EcoOIOS Nco I 



Hind III 



n I I 



100 bp 

Fig. S3. Restriction map and sequencing strategy for the 3'-end 
of HAO cDNA Gower mass 1.1-kb fragment amplified from ss 
cDNA with X7 and ANC, see text). Restriction enzymes and their 
sites used to generate small fragments are shown above the thick 
bar representing the amplified segment. The lines below the bar 
represent the sequenced length of fragments obtained after subclon- 
ing (•) or after exonuclease-m deletion (X) (Henikoff, 1984). The 
higher-mass 1.1 -kb fragment described in the text was sequenced 
entirely using the exonuclease-IU deletion method. Arrows represent 
the direction of the sequencing. 
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ATT ACT CCA GAC CTC ATT CAC TTC TCC AGA TTA TAACGACCTACTGACATCCCTACAACA 1086 
GGAAGACAAGACTTCAACATACTCTCTGAGCCTATTCTTCTTCCCTCCCATCATACCTACTACTTTCAGCCC 1158 
TCTACCTTCA6AAATCCAGATCGATCAAGAAAACATACCTAACACCTACCAGACCCGTCCATTTGCATCAAC 1230 
GAATAACATCTAATCTTCTACAGGATAACTATAACTGACAATTAATTGACTATCTCCAATAGCCATGTTCCA 1 302 
GAAAGAAAGGAAATACTTACATCTCCTGTGACATCCCAATTACTTTCATTrGATTATTATGTACACTATACA 1 374 
TTATTAAAACGTCATACTOTGCCATAATTATGTCCAATTACTATGTATCAATTAAAACAAC AATATATATTT 144 6 

Fig. S2. Composite nucleotide sequence, and deduced amino acid 
sequence, of the cDNA of ot-hydroxy acid oxidase from rat kid- 
ney. The boxed sequence is that obtained after the first PCR amplifi- 
cation with XI and X2 (Table SI). The peroxisomal targeting se- 
quence is underlined. The arrow indicates the position of the nine 
nucleotide insertion (TGAGAAAGG) found in some clones (see 
text). 
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The first nucleotide binding fold (NBFl) of the cystic 
fibrosis transmembrane conductance regulator (CFTR) 
and its disease-causing mutant form (AF508,NBF1) 
were overexpressed in high yield in Escherichia coli 
in Aision with the maltose- binding protein (MBP). The 
rationale for producing the chimerae was to aid in 
domain purification, solubilization, and crystallization 
and to examine the effect of protein-protein interac- 
tions on the properties of the mutcmt NBFl. 

Both the purified wild type and AF508 mutant fusion 
proteins fold into functional nucleotide binding do- 
mains as determined by using the fluorescent nucleo- 
tide analog TNP-ATP (2'-(3')-0-(2.4,6-trinitro- 
phenyl)adenosine-5' -triphosphate). Moreover, the 
prominent secondary structural features of the two 
proteins as assessed by ultraviolet circular dichroism 
spectropolarimetry are very similar, as is the higher 
order structure evident in three separate protease 
digestion patterns. Finally, the stability of the nucleo- 
tide binding function of the two proteins is similar as 
assessed by sensitivity to urea. 

Gel filtration chromatography and electron and con- 
focal microscopy reveal that both fusion proteins, but 
not MBP alone, form organized fibers, suggesting that 
NBFl self-associates, thus raising the possibility that 
CFTR may be oligomeric in the plasma membrane. 
Significantly, in the presence of high salt, these fusion 
proteins also have a propensity to form microcrystals. 
Finally, the two separate domains (NBFl and MBP) 
constituting the fusion proteins appear to interact quite 
strongly as both proteins remain associated even after 
cleavage of their fusion junction. 

The possible relevance of these novel findings to 
those approaches that might be taken to elucidate the 
three-dimensional structural differences between the 
wild type and AF508 mutant forms of CFTR, as well 
as to ameliorate the severity of cystic fibrosis, is dis- 
cussed. 
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The most common lethal autosomal recessive genetic dis- 
ease in the Caucasian population is cystic fibrosis (CF).^ The 
disease results from mutations in the gene that codes for the 
cystic fibrosis transmembrane conductance regulator (CFTR) 
protein comprising 1480 amino acid residues (1-3). Predic- 
tions suggest that CFTR embraces two transmembrane do- 
mains, each having six putative membrane-spanning seg- 
ments; two nucleotide binding folds, NBFl and NBF2, each 
possessing a Walker A and B consensus secjuence (4); and a 
regulatory (R) domain having numerous charged amino acid 
residues, which can be phosphorylated by some cAMP-de- 
pendent protein kinases (5, 6). 

The majority of the CFTR mutations occur within or near 
the two nucleotide binding folds. Significantly, 92% of all 
patients with cystic fibrosis have at least one allele in which 
the codon for phenylalanine 508 in NBFl is deleted (7-9). 
Strong evidence has established CFTR as a chloride ion 
channel regulated by cAMP-dependent phosphorylations (10- 
16). Moreover, recent studies indicate that interaction of 
intracellular ATP with NBFl (or NBF2) may be essential for 
chloride channel activation (17, 18). Hence, the roles of the 
NBFs may be of great importance in elucidating the molecular 
basis of CF. 

A synthetic peptide embracing the Walker A consensus 
sequence and the region surrounding Phe-508 contains less 
/9-sheet structure in the AF508 mutant than in the wild type 
peptide (19, 20). Significantly, the peptide studies demon- 
strated also that this structural change is due to destabiliza- 
tion of the mutant peptide structure, indicating that the 
AF508 mutation results in a protein folding defect (19-21). 
Recent in vivo studies support this finding, as lowering the 
temperature of cells harboring the AF508 mutant protein 
increases the amount of functional protein at the plasma 
membrane (22, 23). Despite the importance of this observa- 
tion, the functional mutant protein exhibits reduced open 
channel probabilities relative to the wild type protein (16, 22). 
The structural basis of the altered channel function of the 
mutant protein in the membrane is currently unknown. 
Therefore, the production of both purified wild type and 
AF508 mutant forms of NBFl in a state amenable to bio- 
chemical and structural studies is necessary to define the 

* The abbreviations used are: CF, cystic fibrosis; CFTR, cystic 
fibrosis trsuismembrane conductance regulator; NBF, nucleotide 
binding fold; TNP-ATP, 2'-(3')-0-(2,4,6-trinitrophenyl)adenosine- 
5 '-triphosphate; HPLC, high performance liquid chromatography; 
PAGE, polyacrylamide gel electrophoresis; X-gal, 5-bromo-4-chloro- 
3-indolyl-0-galactoside; AF508, deletion of phenylalanine 508 from 
CFTR; MBP, maltose-binding protein; PGR, polymerase chain re- 
action; MDR, multidrug- resistant P-glycoprotein. 
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structural change in the mutant protein that leads to altered 
function. 

With the above thoughts in mind, we rationalized that 
perhaps fusion of NBFl and its AF508 mutant form with a 
highly soluble protein might facilitate purification and struc* 
tural characterization. Precedence for this approach derives 
from the recent successful elucidation of the three-dimen- 
sional structure of actin by first forming a complex with 
DNase 1 (24), In the studies reported in detail below, we chose 
to fuse both the wild type and AF508 mutant NBFl domains 
with the maltose-binding protein, a soluble protein for which 
a high resolution crystal structure has already been deter- 
mined (25). 

EXPERIMENTAL PROCEDURES 

Materials 

Clones T16-1 and Cl-1/5 were obtained from American Type 
Culture Collection (ATCC). Primers used in the PCR procedures 
were sjmthesized in the Johns Hopkins University Protein/Pep tide/ 
DNA Synthesis Facility using an Applied Biosystems model 380B 
synthesizer. The pMAL-cRl expression vector, the purification sys- 
tem. Factor Xa, restriction enzymes, and T4 DNA ligase were ob- 
tained from New England Bio-Labs. TNP nucleotides were obtained 
from Molecular Probes. Pfu DNA polymerase and the Plasmid Quick 
DNA Purification Kit were purchased from Stratagene. The DNA 
sequencing kit (Sequenase version 2.0 DNA) was from U. S. Biochem- 
ical Corp., and the maltose -binding protein (MBP) from New England 
Bio-Labs. Waters-Millipore supplied the Protein Pak 300 SW HPLC 
gel filtration column (7.5 mm inner diameter x 30 cm length), and 
uranyl acetate was purchased from Ted Pella. All other reagents were 
of the highest quality commercially obtainable. 

Methods 

Construction of the Expression Vector Containing CFTR NBFl— 
An outline of the approach used is summarized in Fig. 1. The 
expression cassette polymerase chain reaction (EC-PCR) (26, 27) was 
employed to synthesize the NBFl of CFTR. Restriction enzyme sites 
{Bglll and Hindlll), GO clamps, and a stop codon were incorporated 
into the primers, including a Bglll site in the forward primer and a 
Hindlll site in the reverse primer. The sequence of the primers used 
to defme and amplify NBFl from Phe-433 (base 1429) to Ser-589 
(base 1899) are as follows. 

Forward primer: S'-CGCGCAGATCTTTCTCACTTCT 
( 35 bases > TGGTACTCCTGTC - 3' 




Fig. 1. Con struction of the expression vector containing 
NBFl of CFTR. The cDNA of NBFl or AF608,NBF1 is ligated into 
BamHI/Z/tridlll sites of pMAL-cRl in frame with the MBP (malE 
gene product). The insertion of the NBFl cDNA into this plasmid 
results in disruption of the lacZa gene and allows blue to white colony 
selection when bacterial cells are grown on plates containing IPTG 
and X-gal. The pMAL-cRl is 6133 base pairs long and the NBFl and 
iu mutant are, respectively, 471 and 468 base pairs long. For experi- 
mental details, see ''Methods." 
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Reverse primer: S'-GCGCAAGCTTTTAGCTTTCaA 
( 36 ba s es ) ATATrTCTTTTTCTG - 3 ' 

The expression cassette PCR was carried out in a lOO-n\ reaction 
volume composed of 20 mM Tris-Cl, pH 8.8. 2.5 units of Pfu polym- 
erase. 0.5 mM primers, 50 ng of template cDNA, 200 mM dNTPs, 1.5 
mM MgCU. 10 mM KCl, 6 mM (NH4)2S04, and 0.1% Triton X-100. 
The conditions for each PCR cycle was as follows: denaturation at 
94 "C for 43 s, annealing at 43 'C for 1 min, and elongation at 73 •C 
for 1 min and 15 s. 

Clones T16-1 and CM/5 were used as templates to amplify the 
wild type NBFl and the mutant AF508,NBFl, respectively. These 
template plasmids for the PCR were prepared by use of Stratagene*s 
Plasmid Quick Midi Column. PCR amplified products (~468-471 
base pairs expected) were purified by use of Bio-Rad's Prep-A-Gene 
DNA Purification Matrix kit and were subsequently digested with 
Bg£ll and Hindlll restriction enzjrmes. The digested PCR products 
were then subjected to agarose gel electrophoresis, further purified by 
use of Bio>Rad's Prep-A-Gene DNA Purification Matrix kit, and were 
finally ligated into the BamHI/Z/indlll-cut pMAL-cRl expression 
vector (28, 29) in frame with the MBP. The DNA sequence encoding 
for the protein sequence -I-E-G-R-, which is recognized by the pro- 
tease called Factor Xa, is present between the DNA sequences for 
MBP and NBFl. The insertion of NBFl into the vector disrupts the 
expression of lacZa DNA and provides a rapid selection method for 
the correct recombinant. Thus, white colonies instead of blue colonies 
are formed on X-gal in an a-complementing host such as TBI (30) 
or JM107 (31). 

Competent TBI Escherichia coU were transformed with the above 
ligation mixture and grown on LB/agar/ampicillin plates. Plasmids 
purified from the ampicillin-resistant white colonies were sequenced 
by Sanger^s method (32, 33) to confirm the fidelity of the PCR step 
and formation of the proper construct. All other molecular biological 
methods employed were conducted by standard procedures (34). 

Overexpression and Purification of MBP-NBFl Fusion Proteins — 
TBI E. coli harboring the appropriate recombinant plasmids were 
grown at 37 'C in LB media containing 0.2% glucose and ampicillin 
(0.1 mg/ml) until the absorbance at 6(X) nm reached about 0.5 units 
(—2 X 10*^ cells/ml). Cells were then induced with 0.3 mM isopropyl- 
l-thio-/3-D-galactopyrano8ide (IPTG). Incubation was continued for 
an additional 3 h after which the cells were harvested by centrifuga- 
tion at 4000 X g for 20 min. A combination of lysozyme and sodium 
deoxycholate was used to lyse the cells (35, 36). The lysate was 
centrifuged to remove the broken cell walls and debris (Beckman 
Ultracentrifuge L7-55; SW 28 rotor; 4 "C, 28,000 rpm, for 1 h). The 
supernatent was then incubated with 55% saturated ammonium 
sulfate at 4 'C for 30 min, and centrifuged at 12,000 x g for 40 min 
at 4 'C in a Sorvall model RC 2B centrifuge equipped with a SS 34 
rotor (radius = 4.25 inches). The pellet was resuspended in 40 ml of 
the column buffer (10 mM sodium phosphate, 0.5 M NaCI, and 1.0 
mM EGTA, pH 7.2) to give a protein concentration of ~2.5 mg/ml. 

Purification was carried out by loading the resuspended sample 
onto a column containing amylose resin that had been previously 
equilibrated with the column buffer. The binding capacity of the 
column is 2 mg of fusion protein/ml of bed volume. The column was 
washed with three column volumes of column buffer containing 0.25% 
Tween 20 and finally with column buffer without the surfactont. 
Elution of the bound fusion protein was accomplished at a flow rate 
of 1 ml/min with 20 mM maltose dissolved in the column buffer. 
Fractions containing the fusion proteins, detected by UV absorbance 
at 280 nm or by Coomassie dye binding, were pooled and concentrated 
using Amicon's Centri-Prep 30 concentrator. 

SDS'PAGE was carried out as described by Laemmli (37). Crels 
were stained with Coomassie dye to determine the degree of overex- 
pression of the recombinant protein and to estimate its purity. Gas 
phase sequencing of the N terminus (38, 39) of the purified protein, 
as well as an amino acid analysis, established that the product isolated 
was the desired fusion protein. 

TNP- ATP Binding— Fluorescence enhancement of the ATP ana- 
log TNP-ATP (40-42) was used to study the nucleotide binding 
characteristics of the fusion proteins in the presence or absence of 
urea. The procedure used, which includes correcting for the inner 
filter effect on the fiuorescence of the TNP-ATP probe, has been 
described previously (19). Samples (4 mM protein in 20 mM HEPES, 
pH 7.4, at concentrations of TNP-ATP indicated) were excited at 
410 nm (slit width, 1 mm). The emission was then measured at 550 
nm (slit width, 2 mm) using an Aminco model SPF-125 spectrofluo- 
rometer. Finally. ATP was added after TNP-ATP binding to dem- 
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onstrate that ATP could displace bound TNP-ATP. 

Protease Susceptibility — To the purified fusion proteins at concen- 
trations of 60-70 nM in a 20 mM HEPES buffer, pH 7.4, proteases 
(trypsin, endoproteinase Glu-C, or endoprotease Arg-C) were added 
at 1% (w/w) in the presence or absence of 12.5 mM ATP plus 42 niM 
maltose. The mixtures were incubated at 23 or 37 *C for times up to 
90 min. AHquots of the digests were taken at varying times, subjected 
to SDS-PAGE, and then visualized after staining with Coomassie 
Blue dye. 

Circular Dichroism Spectroscopy (CD) — To assess the secondary 
structures of the fusion proteins, CD spectra were obtained at 23.2 *C 
in a 2-mm path length cuvette using an AVIV spectropolarimeter. 
Wild type and mutant fusion proteins were present at a concentration 
of 1.3 fiM in a 0.4 -ml system containing 10 mM Tris-Cl buffer, pH 
7.6. Details of the procedure have been published previously (20). 

Gel Filtration HPLC — Fusion proteins (175 /ig in 1 ml) were loaded 
onto a Protein Pak 300 SW filtration column that had been equili- 
brated previously with 0.2 M HEPES at pH 7.4. Eluiion profiles were 
monitored at 280 nm by an isocratic gradient using a flow rate of 0.5 
ml/min. Molecular size markers used for gel filtration chromatogra- 
phy were blue dextran, /9>amylase, alcohol dehydrogenase, bovine 
serum albumin, and MBP. corresponding to 2000. 200, 150, 66, and 
42 kDa, respectively. 

Amino Acid and N-terminal Sequence Analysis — For amino acid 
analysis, the fusion protein (1280 pmol of the wild type or 920 pmol 
of the dF508 mutant) was vapor-phase- hydrolyzed in 6 N HCl at 
110 *C for 24 h. The resulting hydrolyzed amino acid residues were 
derivatized with phenyl isothiocyanate, and the derivatives were 
separated by reverse phase chromatography and subsequently ana- 
lyzed by PICO-TAG amino acid analysis system software. 

For the N-terminal sequencing of the fusion protein, 100-120 pmol 
were electroblotted from an SDS gel onto a polyvinylidene difluoride 
membrane. The putative 60-kDa molecular mass was excised and 
subjected to Edman degradation chemistry (38, 39) using an Applied 
Biosystems model 475A protein sequencing system, which employed 
both a gas and pulsed liquid phase^ 

Electron Microscopy — Parloidion-coated 200- mesh copper grids 
were stabilized with carbon and glow discharged before applying 1-2 
fiM of the fusion proteins in 20 mM HEPES buffer at pH 7.4. The 
protein samples in a 30-pl vo|uine were adsorbed onto the pretieated 
grid for 30 s. The grid was tken washed successively three times in 
drops of 20 mM HEPES. Finally, the treated grid was floatied on a 
drop of filtered 1% aqueous uranyl acetate for 1 min. The excess 
solution was blotted with Whatman no, 2 filter paper. The negatively 
stained sample on the grid was then examined with a Zeiss transmis- 
sion electron microscope (model lOA) at an acceleration voltage of 
60 kV, The ma^ification was calibrated using an appropriate dif- 
fraction grating replica. For a qualitative determination of the dimen- 
sions of the protein fibers, micrographs were printed at a magnifica- 
tion of 36,000 or 98,000. A Photomicrograph Scale Marker** was used 
to draw scale marks on the print of each electron micrograph. 

Confocal Microscopy — Confocal microscopy of the fusion proteins 
(5 MM) in a volume of 0.5 ml in 20 mM HEPES, pH 7.4, was performed 
at 23 'C in the presence of 20 TNP-ATP. Following an incubation 
of 10 min, dialysis of the sample against 10 mM Tris-Cl at pH 8.0 
(two changes with 300 ml at 35-min intervals) was conducted to 
remove the unbound TNP-ATP. The samples were added to an equal 
volume of 0.1% p-phenylenediamine in a 1:9 (v/v) phosphate- buffered 
saline solution in glycerol. Cover glasses were placed over specimens 
of the fusion protein samples on microscope slides and the edges 
sealed with a cellulose acetate polymer solution to make them ready 
for examination with a Bio-Rad MRC-600 laser scanning confocal 
imaging system. Images of the emission fluorescence at 520 nm of the 
TNP-ATP that was bound to the protein were obtained. For the 
three-dimensional computer-assisted images, two identical sets of 
optical sections were projected in the z axis, pixel-shifted, and super- 
imposed to create the resulting stereo images. 

Cry^ia/Ziiaf ion— Microcrystals were obtained using vapor diffusion 
through hanging drops (43). Fusion proteins at a concentration of 3 
mg/ml in a volume of 48 m1 were incubated with 42 fiM TNP-ATP for 
30 min at 23 *C. Following this incubation, a 2-^1 aliquot from the 
above reaction mixture was combined with 2 ^1 of the precipitating 
solution containing 0.2 M lithium sulfate, 30% polyethylene glycol 
4000, and 0.1 M Tris-Cl, pH 9,0. 

Cleavage of the MBP-NBFJ Fusion Protein— The MBP NBFl 
fusion protein was treated with 0.08% SDS, incubated at 23 *C for 16 
h, and then subjected to the action of Factor Xa at a protease to 



fusion protein ratio of 1:1(X) for 14 h at 4 *C. The degree of cleavage 
was monitored by SDS-PAGE. 

Determination of Protein — Protein was estimated either by the 
method of Bradford (44) using the Coomassie protein assay reagent 
kit from Pierce, or where indicated from amino acid compositional 
analysis. 

RESULTS 

Overexpression and Purification of MBP-NBFl and MBP- 
(£!kF508)NBF2 — The correct orientation and cDNA sequence 
of CFTR nucleotide binding fold 1 and its AF508 mutant in 
the pMAL-cRl expression vector in frame with the MBP was 
first confirmed by sequencing the recombinant plasmids (see 
"Methods"). The entire codon for phenylalanine in the mu- 
tant was absent. As shown in Fig. 2, the overexpression of 
fusion proteins under the control of the tac promoter after 
induction with IPTG is strikingly evident when the induced 
expression level is compared with the uninduced cell lysate. 
As the MBP exhibits a high affinity for amylose, the purifi- 
cation of the fusion proteins by amylose column affinity 
chromatography was simple and rapid. A yield of approxi- 
mately 25 mg of purified proteins per liter of cell culture was 
achieved. The resultant proteins appear >95% pure as judged 
by SDS-PAGE (Fig. 2) and have the expected apparent mo- 
lecular masses of 60 kDa (42 kDa for MBP + 18 kDa for 
NBFl). Moreover, amino acid analysis gave experimental to 
theoretical ratios very close to I in most cases (Table I). 
Significantly, the ratio of picomoles of phenylalanine to the 
number of picomoles of other amino acids, obtained from 
amino acid analysis, is consistently lower for the AF608 
mutant fusion protein than for the wild type, confirming that 
the number of phenylalanine residues is less in the mutant. 
Finally, in experiments not presented here, N-terminal se- 
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Fig. 2. Analysis by SDS-PAGE of overexpressed and puri- 
fied wild type and mutant MBP-NBFl fusion proteins. Left 
panel, to demonstrate the overexpression of the MBP-NBFl fusion 
proteins the whole cell lysate (300 fil, 2 x 10^ cells/ml) before IPTG 
induction was solubilized in SDS sample buffer and loaded onto the 
gel designated as "uninduced cells** (UI), SDS-PAGE was then car- 
ried out by the method of Laemmli (34). The E. coli cell lysate (75 m1> 
stationary phase cells) after IPTG induction was treated in the same 
manner and designated "induced cells'* (/). Molecular size markers 
are, from top to bottom^ phosphorylase b (97.4 kDa), bovine albumin 
(66.2 kDa), ovalbumin (45 kDa), carbonic anhydrase (31 kDa), trypsin 
inhibitor (21.5 kDa), and lysozyme (14.4 kDa). Right panels the fusion 
proteins (25 Mg) purified by amylose affinity chromatography were 
subjected to SDS-PAGE. Both proteins exhibit an approximate mo- 
lecular mass of 60 kDa. 
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quence analysis (10 amino acids) also confirmed the identity 
and purity of the fusion proteins. 

TNP'ATP Binding Characteristics^TNP- ATP was found 
to bind similarly to both the wild type and mutant fusion 
proteins as depicted in Fig, 3. ATP effectively displaces all of 
the bound TNP-ATP, indicating that the site involved is 
capable of binding the physiological substrate. The apparent 
Kd values determined according to a method previously re- 
ported (45) for binding of TNP-ATP to the wild type and 
mutant fusion proteins are 1.8 and 2.3 mM» respectively. The 
corrected Kd values for ATP binding, determined by compe- 
tition analysis between TNP-ATP and ATP, are 1.8 and 1.9 
mM, respectively. The stoichiometry of TNP-ATP binding is 
approximately 1 for both types of fusion proteins under the 
conditions employed. In the control experiment, MBP alone 
did not bind TNP-ATP even at concentrations exceeding 5 

Table I 

Amino acid analysis of CFTR wild type and AF508 MBP-NBFl 
fusion proteins 

Analyses were carried out as described under **Methods" on fusion 
proteins that had been overexpressed in E. coU and puriHed by 
amylose column affinity chromatography, also as described under 
"Methods." The mol ratios of the calculated amino acid residues to 
the theoretical values are based on 48 mol of Leu/mol of fusion 
protein. A yield of 100% was assumed for the leucine residues. 



Amino 
acid* 
(X) 


Wild type 
(experimental/ 
theoretical) 


Mol ratio 
(Phe/X) 


Mutant 
(experimental/ 
theoretical) 


Mol ratio 
(Phe/X) 


Ala 


1.02 


0.448 


1.01 


0.433 


Arg 


1.20 


1.444 


1.20 


1.382 


Asx* 


1.16 


0.366 


1.15 


0.356 


Cys 


0.85 


14.280 


0.83 


14.006 


Gly 


0.96 


0.636 


0.96 


0.515 


Glx^ 


1.12 


0.393 


1.12 


0.379 


His 


1.63 


3.735 


1.62 


3.588 


He 


0.84 


0.739 


0.85 


0.701 


Leu 


1.00 


0.506 


1.00 


0.484 


Lys 


1.06 


0.486 


1.06 


0.467 


Met 


1.33 


1.657 


1.37 


1.547 


Phe 


1.01 


1.000 


1.01 


1.000 


Pro 


0.94 


0.994 


0.93 


0.962 


Ser 


0.99 


0.724 


0.99 


0.691 


Thr 


0.92 


0.980 


0.89 


0.966 


Tyr 


1.01 


1.155 


1.03 


1.076 


Val 


0.98 


0.850 


1.02 


0.783 



" Tryptophan is normally degraded and not detected. 
^ Asx» aspartic acid + aaparagine. 
' Glx, glutamic acid + glutamine. 



mM. This important control emphasizes that the nucleotide 
binding properties of the fusion proteins, as expected, are due 
to interaction of TNP-ATP or ATP with NBFl or 
AF608,NBF1 per se and not with the MBP. 

Secondary Structure as Assessed by Circular Dichroism 
Spectroscopy — The overall secondary structures of the wild 
type and AF508 mutant fusion proteins were estimated by 
CD spectropolwimetry. As shown in Fig. 4, the resultant 
spectra, which summarizes the mean residue molar ellipticity 
as a function of wavelength, reveals no obvious differences 
between the wild type and AF508 mutant fusion proteins. In 
both cases, the mean residue molar ellipticity of the fusion 
proteins exhibits a maximum at 197 nm and two minima, one 
at 209 nm and the other at 223 nm. Deconvolution of the 
spectra using the Prosec program (46) predicts that both wild 
type and AF508 mutant fusion proteins contain nearly iden- 
tical secondary structural elements distributed as approxi- 
mately 40% a-helix, 24% jS-strand, 13% turns, and 24% ran- 
dom coil. Thus, both fusion proteins are highly structured as 
assessed by this technique. 

Protease Susceptibility — To test for subtle conformational 
differences between the wild type and the AF508 mutant, both 
fusion proteins were incubated with three different proteases 
(trypsin, endoproteinase Glu-C, and endoproteinase Arg-C), 
and their susceptibility to proteolysis was analyzed by 10% 
SDS-PAGE. Trypsin in the absence (Fig, 6A), and in the 
presence (Fig. bB) of 12.5 mM ATP plus 42 mM maltose 
produces essentially identical digestion patterns for the two 
fusion proteins. ATP plus maltose seems to have no effect in 
protecting the proteins from trypsin digestion (Fig. bB). Fur- 
thermore, the kinetics of the degradation of the two fusion 
proteins by trypsin is identical. Trypsin was the most effective 
in 3-min digestion, where the starting proteins were mostly 
degraded. Endoproteinase Glu-C required about 20 min to 
degrade the fusion proteins to a similar degree (Fig. 5C). 
Endoproteinase Arg-C was found to be the least effective (Fig. 
5D). Like that of trypsin, the digestion patterns of the endo- 
proteinases Glu-C and Arg-C are very similar for both wild 
type and mutant fusion proteins. 

Susceptibility to Denaturation by Urea — Previous studies 
with synthetic peptides of NBFl have shown that loss of the 
TNP-ATP binding function in the presence of the denaturant 
urea correlates with loss of secondary structure (19. 20). This 
simple test has been used here to assess the relative stabilities 
of the mutant and AF508 fusion proteins. As shown in Fig. 6 



Fig. 3. The nucleotide binding 
function of MBP-NBFl fusion pro- 
teins. The enhanced extrinsic fluores- 
cence of TNP-ATP upon binding to the 
fusion proteins was observed at 550 nm 
emission wavelength after exciting the 
samples at 410 nm. When ATP was 
added to the above sample to compete 
off the bound TNP-ATP, the fluores- 
cence of the TNP-ATP rapidly de- 
creased. See "Methods" for details. 
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{A and B), both fusion proteins lose more than half their 
capacity to bind TNP-ATP at a concentration of 1.8 M urea, 
and at 3.6 M urea most of the binding capacity is lost in both 
cases. Thus, this test indicates that the stability of the nu- 
cleotide binding function of the wild type and AF508 mutant 
fusion proteins are similar. 

Formation of Highly Organized Macromolecular Species — 
The tendency of both the wild type and dF508 fusion proteins 
to form high molecular weight species is evident from the 
chromatograms of gel filtration experiments (Fig. 7, A and 
B), Significantly, both fusion proteins elute at the void volume 
of an HPLC gel filtration column, exhibiting a molecular 
exclusion limit > 2000 kDa as defined by blue dextran (Fig. 
7C). In contrast, the MBP, which has a molecular mass of 42 
kDa, elutes as one major and two minor peaks all well within 




Fig. 4. Circular dichroism spectra of the MBP-NBFl fusion 
proteins. The molar mean residue ellipticity of the wild type or the 
mutant protein as a function of wavelength is depicted. Fusion 
proteins (1.3 mM) both dissolved In 10 mM Tris-Cl, pH 7.6, were 
subjected to CD spectroscopic studies using a 2<mm path length 
cuvette as described under "Methods.** 



the inclusion volume of the column. 

Electron microscopy studies with negatively stained sam- 
ples indicate that both the wild type and AF508 mutant fusion 
proteins are present in the form of long, highly organized 
fibers exhibiting widths of about 20 nm, and that individual 
fibers tend to associate to form bundles having collective 
widths of 50-380 nm (Fig, SA ), As shown by confocal fluores- 
cence imaging (Fig. 8, B and C), TNP-ATP can bind through- 
out these organized fibrous networks, indicating that each 
fusion protein within the network has retained the capacity 
to bind nucleotide. The fiber bundles comprising fusion pro- 
teins to which TNP-ATP is bound are about 1500 nm in 
width. 

Formation of Microcrystals — When fusion proteins in the 
presence of TNP-ATP were subjected to conditions com- 
monly used to crystallize proteins (see "Methods'*), the for- 
mation of microcrystals occurred in the presence of lithium 
sulfate and polyethylene glycol (Fig. 8I». Crystals (<0.005 
mm) were apparent usually after 3 weeks and have the light 
orange color characteristic of the TNP-ATP molecule. Micro- 
crystals were not formed under identical conditions in the 
absence of the fusion proteins. 

Proteolytic Cleavage of the MBP-NBFl Fusion Protein Fol- 
lowed by Chromatography on an Amylose Column — The fusion 
junction between the MBP and NBFl contains a recognition 
site for the protease involved in blood clotting caUed Factor 
Xa. Attempts to cleave this site using a variety of conditions 
in which ionic strength, various detergents, solvents, pH, and 
temperature were tested proved unsuccessful, suggesting a 
strong interaction between the MBP and NBFl within the 
fusion protein that masked access of the protease to the 
cleavage site. For this reason, a low concentration of SDS was 
employed in the proteolytic incubation medium, and as shown 
in the subsequent SDS-PAGE profiles (Fig. 9A). NBFl is 
now almost completely cleaved. However, if the cleaved fusion 
protein is first subjected to chromatography on an amylose 
column, the NBFl remains associated with the MBP and 
coelutes with it upon addition of the maltose containing buffer 



Fig. 5. Susceptibility of MBP- 
NBFl fusion proteins to proteases. 

The digestion patterns of fusion proteins 
analyzed by SDS-PAGE are depicted. A, 
trypsin; fl, trypsin in the presence of 42 
mM maltose plus 12.5 mM ATP; C, en- 
doproteinase Glu-C; D\ endoproteinase 
Arg-C. Times of digestion are indicated. 
See "Methods" for details. Molecular 
size markers are identical to those noted 
in the legend to Fig. 2. 
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Fig. 6. Effect of urea on the nucleotide binding function of 
fusion proteins. The wavelengths for excitation and emission of 
TNP-ATP are the same as in Fig. 3. Emission fluorescence of TNP- 
ATP bound to the wild type (A) and the mutant (fl) at varying 
concentrations of urea is shown. 
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Fig. 7. Gel filtration HPLC on the MBP-NBFl fusion pro- 
teins. HPLC chromatography was carried out exactly as described 
under "Methods.** Retention times for wild type {A ), mutant (B), and 
blue dextran (C) are 11 roin as depicted in the elution profiles of gel 
filtration chromatogams. MBP (£>) eluted as a major peak with a 
retention time of 16.5 min with two minor peaks with retention times, 
respectively, of 14.0 and 18.5 min. 



(Fig. 9B). The identity of the cleaved NBFl was confirmed 
by sequencing the N terminus of the protein band designated 
with the lower arrow in Fig. 9B. (It should be noted in Fig. 9 
{panel A, lane 3, and panel B, lanes 3 and 4) that Factor Xa 
causes some nonspecific cleavage of the maltose-binding pro- 
tein and the NBFl as indicated by several additional bands.) 

These results indicate that within the MBP-NBFl fusion 
complex a tight association occurs between the two fusion 
partners that is retained even after cleavage of the fusion 
junction. 

DISCUSSION 

The novel experiments described here are part of a longer 
term effort directed at understanding the molecular and chem- 
ical basis of cystic fibrosis. In the vast majority of cases of 
the disease (--92%), the problem distills down to first eluci- 
dating those structural perturbations in the CFTR protein 
resulting from deletion of phenylalanine 508 within the first 
nucleotide binding fold (NBFl), and then elucidating how 
these perturbations impair ATP-dependent chloride channel 
activity. Previous experiments in this laboratory conducted 
with long synthetic peptides have demonstrated directly that 
NBFl binds ATP (19). In addition, they indicate that Phe- 
508 lies within a /3 sheet region, that its deletion may result 
initially in a register shift to restore the sheet (17), but that 
the resultant structure is less stable (20), and, depenfUng on 
conditions, may unfold impairing ATP binding essential for 
normal chloride channel activity (20, 21, 47). Significantly, 
this simple working model is supported by recent studies in 
eukaryotic cells demonstrating that at low temperature some 
CFTR can reach the plasma membrane in functional form, 
albeit reduced, whereas at higher temperature {e.g. 37 "C) this 
apparently does not occur (22, 23). 

Operating within the framework of the above model, the 
objective of this study was to obtain in large amounts purified 
preparations of wild type and AF508 mutant NBFl domains 
in which structural changes are minimal. Assuming that the 
initial change resulting from the Phe-508 deletion is a register 
shift within a region of secondary structure* the initial per- 
turbation may be quite subtle* detectable only by sophisticated 
structural approaches, Le. x-ray crystallography or NMR 
spectroscopy. To this end, we were able to successfully fuse 
both the wild type and AF508 NBFl domains to the maltose - 
binding protein, a protein for which a crystal structure is 
known at high resolution. Moreover, these fusion proteins 
could be overexpressed in large amounts and rapidly purified 
in a single step (Fig. 2). 

The resultant wild type and AF508 mutant fusion proteins 
were shown to be virtually indistinguishable in their second- 
ary structural properties as assessed by circular dichroism 
spectrosopy (Fig. 4), a finding supported by experiments 
showing that both fusion proteins exhibit similar proteolytic 
degradation profiles (Fig. 5). The finding that the resultant 
fusion proteins both bind amylose and TNP- ATP (Figs. 2 and 
3) demonstrates that within the fusion complex both the MBP 
and NBFl have folded correctly to allow formation of their 
respective substrate binding sites. Finally, the additional ob- 
servation that the loss of TNP-ATP binding capacity as a 
function of urea concentration is similar (Fig. 6) indicates 
that within the fusion proteins no major differences in stabil- 
ity exist between wild type and AF508 mutant NBFl domains 
under the conditions tested. As the maltose -binding protein 
remains tightly associated with NBFl even after cleavage of 
the MBP-NBFl fusion junction (Fig. 9), direct protein-pro- 
tein interactions between the two fusion partners may be 
responsible for minimizing major differences between wild 
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Fig. 8. Electron micro^aphs. confocal fluorescence images, and microcrystals of MBP-NBFl fusion proteins. A, electron 
micrograph of wild type MBP-NBFl fusion protein negatively stained with uranyl acetate (see "Methods") depict their fibrous structures. 
The calibration bar shown on A represents 100 nm in length; magnification, x 98,000, (Similar micrographs were obtained also for the AF508, 
MBP-NBFl fusion protein.) B, confocal fluorescence microscopy was carried out exactly as described under "Methods," after which colors of 
the fibers were arbitrarily assigned. Emission fluorescence of TNP-ATP bound to the wild type {blue) and the mutant (red) at 520 nm are 
represented as two-dimensional images. Both are visualized in the form of fibrous structures. The length of the calibration bar represents 25 
nm. C, confocal fluorescence microscopy of the wild type fusion protein depicting stereo images (superimposed red/blue anaglyph) of fibrous 
structures in three dimensions. Z), microcrystals of the MBP-NBFl fusion protein. Crystals were obtained by using vapor diffusion employing 
the hanging drop method (see "Methods"). 



type and AF508 mutant NBFl domains. 

The additional finding reported here that the MBP-NBFl 
fusion proteins, and not the MBP alone, form highly orga- 
nized macromolecular species in aqueous media (Fig. 8), is 
fascinating and of considerable interest for several reasons. 



First, there have been reports (48, 49) indicating that the 
related multidrug- resistant P-glycoprotein (MDR) may exist 
as a dimer or higher ordered oligomeric species in the plasma 
membrane. The same may be the case for CFTR. Within such 
a complex, stability may be maintained in part by direct 
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Fig. 9. Factor Xa cleavage of the wild type MBP-NBFl 
fusion protein. A, the wild type (usion protein (MBP-NBFl) was 
treated with Factor Xa exactly as described under "Methods" and 
then subjected to SDS-PAGE. In all cases, 5 fig of protein was loaded 
onto the gel. Lane 7, untreated MBP control; hne 2, untreated MBP- 
NBFl fusion protein control; Ume 3, MBP-NBFl treated with Factor 
Xa. B, separation of NBFl from the Factor Xa cleavage mixture was 
attempted by binding the MBP component to an amylose resin. To 
do this the product from the cleavage reaction was mixed with the 
resin^ incubated at room temperature for 10 min, and then cent riiuged 
at 4000 X g for 10 min. The resultant pellet was washed with 20 mM 
HEPES, pH 7.6, and centrifuged again for 10 min at 4000 x g. Then, 
the proteins bound to the resin were eluted with 20 mM maltose 
containing 20 mM HEPES, 100 mM NaCl, and 2 mM CaCls. pH 7.5. 
Both the bound and unbound fractions (5 ^g) were then analyzed by 
SDS-PAGE. Lane i, untreated MBP control; lane 2, untreated MBP- 
NBFl fusion protein control; lane 3; the amylose unbound fraction; 
lane 4, the amylose bound fraction. In both A and B, the upper arrow 
indicates the position of the MBP and the lower arrow the position 
of the cleaved NBFl. The identity of the NBFl was confirmed by N- 
terminal sequencing of the band. Molecular size markers are the same 
as those indicated in the legend to Fig. 2. (See text for explanation of 
multiple bands in A, lane 3, and lanes 3 and 4.) 



interactions among NBFl domains of separate MDR or 
CFTR molecules. Removal of NBFl domains alone from such 
an environment may allow their normal propensity to interact 
to go "unchecked" with polymer formation resulting. Second, 
the overexpressed NBFl domain of CFTR (50, 51) and the 
nucleotide domain of the related bacterial HisP protein (52) 
have been reported to interact with the lipid bilayer of the 
membrane. This amphipathy of the domain must be compen- 
sated when placed in an aqueous buffer. Thus, polymer for- 
mation and, hence, stability could be promoted by interaction 
of the nonpolar regions of individual NBFl molecules. Third, 
regardless of why such organized macromolecular assemblies 
of MBP-NBFl fusion proteins form, the fact that they do 
form in aqueous buffer may prove useful in attempts to form 
crystals for structural analysis. Certainly, microcrystals con- 
taining TNP-ATP are readily formed in the presence of 
lithium sulfate and polyethylene glycol (Fig. 8), and efforts 
are currently under way in an attempt to grow large crystals 
suitable for x-ray diffraction analysis. 

Finally, it should be noted that the studies reported here 
may have direct relevance to a new type of gene therapy that 
might be attempted in future studies to ameliorate the severity 
of cystic fibrosis caused by AF508 mutations. Thus, the intro- 
duction into airway epithelial cells of a cDNA encoding a 
small stable protein designed to target and interact specifi- 
cally with the i^F508,N6Fl domain may stabilize defective 
CFTR molecules restoring their normal function. 
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Multidrug'resistant tumor cells overexpress P-glyco- 
protein (170 kDa), a member of the ABC (ATP Binding 
Ca88ette)-transporter superfamily. P-glycoprotein has 
been implicated in transport of a broad range of am- 
phiphilic, hydrophobic drugs from tumor cells. The se- 
quence and structural organization of P-glycoprotein, 
which consists of 12 transmembrane helices and two 
cytoplasmic nucleotide binding domains, is similar to 
other ABC-transporters. It is believed that the nucleo- 
tide binding domains of various ABC transporters* 
which have 30-^% sequence identity, play an impor- 
tant role in coupling ATP hydrolysis to the transport 
process. To allow structure-function studies of the nu- 
cleotide binding domains, the carboxyl-terminal nucle- 
otide binding domain (NBD) of Chinese hamster P-gly- 
coprotein has been cloned, overexpressed, and purified 
both by itself and as a fusion with maltose-binding pro- 
tein. It has been demonstrated that the carboxyl-termi- 
nal NBD, when overexpressed in Escherichia colt in the 
absence of transmembrane helices, has very low ATp£ise 
activity. This suggests that the amino-terminal nucleo- 
tide binding domain and possibly interaction with the 
transmembrane domains may be required for full 
ATPase activity. It is also consistent with the idea that 
the ATPase activity of P-glycoprotein is stimulated in 
the presence of drugs. Circular dichroism spectral anal- 
ysis and the abUity of carboxyl-terminal NBD, both by 
itself and as a fusion with maltose-binding protein, to 
bind ATP*agarose beads and P-glycoprotein specific 
monoclonal antibodies suggests that the polypeptide 
folds into a functional domain. Gel filtration chromatog- 
raphy €md cross-linking studies indicate that the car- 
boxyl-terminal NBD has a tendency to self-associate to 
form oligomers. It is specmlated that the carboxyl-termi- 
nal NBD may play a role in self-association of P-glyco- 
protein molecules in the plasma membrane. 



Multidrug resistance, in which cancer cells manifest resist- 
ance not only to chemotherapeutic drugs but also to other 
structurally unrelated drugs, is a major clinical problem in 
cancer treatment (for reviews, see Refs. 1 and 2). This phenom- 
enon is associated with the overexpression of the 170-kDa 
membrane-associated P-glycoprotein (Pgp).^ This protein con- 
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sists of two halves each containing six predicted hydrophobic 
membrane spanning segments followed by a cytoplasmic do- 
main (3). The sequence and domain organization of Pgp is 
typical of the ABC (ATP binding cassette) superfamily of active 
transporters and is presumably the result of an early gene 
duplication (4). The two cytoplasmic domains of Pgp, which 
include the familiar Walker A and B motifs for nucleotide 
binding (5), show extensive sequence identity with the NBDs of 
other ABC-transporters (6, 7). Pgp functions as an energy-de- 
pendent drug efflux pump and thus plays a major role in the 
reduced accumulation of drugs inside the cell (8). Plasma mem- 
brane vesicles prepared from multidrug-resistant cells and recon- 
stitution of purified Pgp in vesicles confirms that the drug trans- 
port is dependent on the constant supply of energy either from 
ATP or GTP (9-12). Pgp has also been associated with chloride 
(13) and ATP diannel activities (14), and has peptide transport 
ability (1). ATP has been implicated in all the functions of Pgp. 

Several residues have been mutated in the Walker A motif of 
both NBDs of Pgp with varied effects. Some mutations resulted 
in loss of transport activity but had no effect on binding to ATP 
analogues, whereas other mutations effected both transport and 
ATP binding (15). Also, a double mutation which alters both 
NBDs simultaneously shows reduced ATP channel activity (14). 
However, the mutations which alter the ATP hydrolyzing prop- 
erties of Pgp have no effect on chloride channel activity. This 
suggests that unlike drug transport, the chloride channel func- 
tion of Pgp does not require energy from ATP hydrolysis, al- 
though ATP binding is required for channel activation (13). 

Several monoclonal antibodies recognizing either the extra- 
cellular or cytoplasmic domains of Pgp have been selected (1). 
Monoclonal antibody C219 is specific for a continuous short 
sequence present in both NBDs. Another monoclonal antibody, 
C494, recognizes only the carboxyl-terminal NBD. The binding 
of either C219 or C494 to Pgp inhibits the binding of ATP (16). 
Another antibody, JC66, which is specific for the distal carbox- 
yl-terminal half of Pgp, completely inhibits the ATP channel 
activity of Pgp (14). The fact that C219 and C494 are equally 
effective in inhibiting ATP binding to Pgp and the inhibition of 
ATP channel activity by JC66 suggests that there is cooperat- 
ivity between the NBDs. 

The role of the NBDs in Pgp function, the structural changes 
upon ATP binding, and the coupling of ATP hydrolysis to the 
transport process remain poorly understood. This is primarily 
due to the complex structural organization and low abundance 
of the ABC-transporters (17, 18). Towards detailed structure- 
function study of the NBDs, we have cloned, expressed, puri- 
fied, and characterized the carboxyl-terminal NBD of Chinese 
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Fig. 1. Amplification of tlie carboxyl-tenninal NBD. the Pgp 

is schematically represented with the transmembrane domains boxed 
and numbered 1-12 and two NBDs represented by A and B to denote 
Walker sequences. Two clones CTM-NBD and ONBD were amplified. 
B, an agarose gel (1.0%) showing the amplified fragments. Lane 1\ 100 
base pair ladder; Lane 2, CTM-NBD; Lane 3, C-OTD. C, cloning of 
CTM-NBD in pT7-7. Various clones were picked, plasmid DNA was 
isolated. DNA was digested with restriction enzymes Ndel and BomHl 
to confirm the presence of the insert. Lanes 1, 3, and 4 represent various 
clones screened for Uie CTM>NBD insert, Z), restriction map of C-NBD. 
E, restriction digest analysis of C-NBD. Lane i, markers (1057, 770, 
612, 564, 495. 393. 345, and 341 bp); Lane 2, BamHI; Lane 5. Pstl (1800 
bp fragment); Lane 4, Ncol (530- and 330>bp fragments); Lane 5, Xhol 



hamster Pgp. Three dififerent constructs were made in three 
different expression systems in an effort to optimize the yield 
and solubility of the purified protein. Expression as a fusion 
with maltose-binding protein (C-MBP) resulted in good protein 
yield, with the advantage that the known MBP structure (19) 
can aid in crystallographic structiire determination of the car- 
boxyl-terminal NBD of Pgp. Circular dichroism, and the ability 
of the expressed domains to bind ATP and antibodies, sug- 
gested that they retain a native-like fold. The fusion protein, 
C-MBP, retains relatively weak ATPase activity in the absence 
of transmembrane domains and the other NBD. The tendency 
of the carboxyl-terminal NBD to form oligomers suggests it 
may play an importsmt role in self-association of Pgp in the 
plasma membrane. 

EXPERIMENTAL PROCEDURES 
Materials 

A cDNA done for Chinese hamster ovary Pgp was obtained from Dr, 
Victor Ling, Ontario Cancer Institute. Primers used for the polymerase 
chain reaction (PGR) were sjmthesized in the Ontario Cancer Institute 
Biotechnology Laboratory using an Applied Biosystems 392 DNA/RNA 
synthesizer. The pMal-c2 expression vector, amylose affinity resin, 
restriction enzymes. Vent DNA polymerase, and T4 DNA ligase were 
purchased from New England Biolabs. The pT7-7 expression vector 
was kindly given to us by Dr. Bruce Waygood, University of Saskatch- 
ewan- The pET-22b(+) expression vector was obtained from Novagen, 
Ni-NTA-agarose and DNA purification kit were obtained from Qiagen 
Inc. ATP-agarose beads were purchased from Sigma. The DNA sequenc- 
ing kit (Sequenase version 2.0 DNA) was from U. S. Biochemical Corp. 
A Pharmacia FPLC system was used for gel filtration. 

Methods 

Construction of the Expression Vector Containing Pgp Carboxyl-ter- 
minal iVBD-— Two different clones of the carboxyl-terminal NBD were 
amplified by PCR (Fig. lA). CTM-NBD, which includes the last trans- 
membrane (TM) helix, TM12 (M963) to the end of the transcript was 
cloned into the expression vector pT7-7 under the control of phage T7 
RNA polymerase promoter (Fig. 2). The restriction enzyme sites, iVdel 
in the forward and SftaBI in the reverse PCR primer shown below, were 
incorporated to facilitate cloning into vector pT7-7 (20). Another con- 
struct, C-NBD, including amino acids Gln-984 to Arg-1276 but lacking 
TM12, was also amplified by PCR using the primers shown below with 
BamHI and EcolXi restriction enzyme sites in the forward and reverse 
primer, respectively. C-NBD was cloned into pET-22b(+), also under 
the control of strong phage T7 transcription and translation signals 
(21). This expression vector contains a pelB leader sequence for the 
export of the target protein into the periplasm for improved folding and 
easier purification (22). Also at the COOH terminus of the protein there 
are six consecutive histidine residues. This hexahistidine tag facilitates 
purification through the high affinity of histidine for Ni-NTA resin 
(Qiagen Inc.), PCR-Mfel: 5 -(KK?AA£ATAT6ACATTTG AAAATG TTC- 
TATTAGTAT.3'; PCR-SnaBI: S'-GCKMJT^SfiTACTTTCTTTTTAACJT- 
TTGGTTAAATG03'; PCRpBomHI: S'-GGGCGQGASIgfi GCAGG TCA- 
GTTCATTTGCT.3'; PCR-JScoRI: S'-GCSGGGfiMEEGGCGCTTTGCTC- 
CAGCCTGC-3'. 

The polirmerase chain reaction was carried out in a 100-ftl reaction 
volume containing 20 mM Tris-HCl, pH 8.8, 10 mM KCI, 10 mM 
(NH4)2S04, 2 mM MgS04, 0.1% Triton X-lOO, 2.5 units of Vent DNA 
polymerase, 100 pmol of each primer, and 50 ng of template DNA which 
had been purified using QIAprep-spin plasmid purifications (Qiagen 
Inc.). The PCR reaction was carried out for 30 cycles and each cycle was 
performed as follows: denaturation at 92 ''C for 30 s, annealing at 60 ^'C 
for 30 8, and extension at 72 *»C for 30 s. The amplified PCR producU, 
CTM-NBD (1000 bp) and C-NBD (880 bp), were purified using the 
QIAquick-spin PCR purification kit (Qiagen Inc.) and subsequently 
digested with appropriate restriction enzymes. The digested product 
was then subjected to agarose gel electrophoresis and the DNA was 
extracted using the QIAEX DNA gel extraction protocol (Qiagen Inc.). 

The purified PCR product for CTM-NBD was ligated into the NdeV 
Smal restriction sites of expression plasmid pT7-7 and C-NBD was 
ligated into the BamHlfEcdKi sites of plasmid pET-22b(+). Competent 



(624 bp fi^gment); Lane 6, BamKl-Hindni (880-bp fragment); Lane 7, 
same as lane 1, 
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Fig. 2. The schematic representa- 
tion of cloning of <»rbozyI*terminaI 
NBD in the expression vector. The 

cloning into the expression vector pT7— 7, 
pET22b( + ), and subcloning into pMAL,-c2 
is diagrammatically represented. The am- 
phfied DNA firs^ent of CTM-NBD is U- 
gated into NdeVSmal site of pT7-7 ex- 
pression vector under the control of T7 
promoter. The StuVHindill fragment 
from pT7-7 vector containing CTM-NBD 
fragment was ligated into EcoBl site 
which was blunt ended with Klenow frag- 
ment of DNA polymerase and the i/indlll 
site in the multiple cloning site of pMal-c2 
under the control of tac promoter. The 
insertion of the DNA fragment results in 
the disruptions of /acZa gene. When the 
plaismid containing the desired DNA frag- 
ment is transformed in laca, complement- 
ing host white colonies are formed instead 
of blue in the presence of 5-bromo-4- 
chloro-3-indoyl /3-ivgalactoside and IFTG. 
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BL21(DE3)pLysS cells were transformed with the ligation mixture and 
clones were screened using PCR (23). The procedure is briefly described 
as follows: a single colony that is 1 mm in diameter was picked with a 
sterile pipette tip. Before transferring the colony to an Eppendorf tube 
containing 50 ^1 of sterile water, the pipette was touched to a 
YT+ampicillin (100 /tg/ml) plate for later use. The tubes were placed in 
boiling water for 5 min and centrifuged for 1 min. A IO-/1I aliquot of the 
supernatant was used for the PCR reaction using T7-promoter and 
terminator primer as described above. 

To express the NBD as a fusion with maltose-binding protein, a 
Stul-Hindlll fragment (8(X) bp), including amino acid Leu-1023 to the 
end of the transcript, was subcloned fr*om pT7-7 CTM-NBD into an 
EcoBl restriction enzyme site filled with the Klenow fragment of DNA 
polymerase and the Hindlll restriction site of expression plasmid 
pMal-c2 (24) (Pig. 2). The insertion of the gene into the vector inacti- 
vates the 0-gBlactosidase a-fragment activity of the malE4acZa fusion 
and results in a blue to white color change on 5-bromo-4-chloro-3-indoyl 
^-D-gaiactoside and isopropyl-l-thio-/i-D-galactopyranoside (IPTG) 
plates when the construct is transformed into an a-complementing host 
such as JMlOl or NM522. The ligation mixture was transformed into 
competent NM522 Escherichia coli cells and grown on YT-agar plates 
containing £unpicillin, 5-bromo-4-chloro-3-indoyl 0-D-galactoside, and 
IPTG. The white colonies were screened by restriction digest analysis 
and subsequently sequenced using the Sanger dideoxy method (25) to 
confirm the fidelity of the PCR step. 

Overexpression and Purification of CTM-NBD, C-NBD, and 
C'MBP — ^BL21(DE3)pLy8S cells containing the appropriate construct, 
either CTM-NBD or C-NBD, were grown in 2 x YT broth containing 100 



ME^ml ampicillin at 30 ''C until was 0.7. The culture was then 
induced with 1 mM IPTG and harvested 3 h later at 5000 X ^ for 20 min. 
After washing with 0.85% (w/v) NaCl, cells were resuspended in lysis 
buffer consisting of 20 mM Tris-HCl, pH 8.0. 10% glycerol, 500 mM NaCl, 
1 mM phenylmethylsulfonyl fluoride, 10 mM ^-mercaptoethanol, 0.1% 
Nonidet P-40, 10 tA^/^^ pepstatin, 10 Atg/ml leupeptin, and 1% (vA^) 
aprotinin. The cells were frozen and kept at —20 °C until required. The 
frozen cells were thawed in a warm water bath and sonicated on ice for 
2x5 min at a power setting 10 and 10% duty cycle with a 5-min 
interval using a Branson Model 450 sonifier. The lysed cells were 
centrifuged at 20,000 X ^ in a 45 Ti rotor (Beckmann) for 20 min. For 
C-NBD, supernatant was loaded onto a Ni-NTA-agarose column equil* 
ibrated with lysis buffer without protease inhibitors. After extensive 
washing, the protein was eluted with lysis buffer plus 200 mM imidaz- 
ole. The fractions containing protein were detected by SDS-PAGE (26), 
pooled, and concentrated. 

For CTM-NBD, the crude extract was loaded onto a Q-Sepharose 
(Pharmacia) column equilibrated with lysis buffer and the protein was 
eluted with a 0-1 m NaCl gradient The fractions containing protein were 
concentrated and loaded onto a S-200HR (Pharmacia) gel filtration col- 
umn also equilibrated with lysis biiffer. After SDS-PAGC anal3^, the 
finactions containing the protein of interest were pooled and concentrated. 

For C-MBP, cells were grown, induced, harvested, and lysed in a 
similar fashion to those for CTM-NBD or C-NBD. The crude extract was 
loaded onto an amylose-afRnity column equihbrated with lysis buffer. 
The column was extensively washed and the protein was eluted with 
the same biiffer containing 10 mM maltose (27). Fractions were ana- 
lyzed by SDS-PAGE, pooled, and dialyzed against 10 mM Tris-HCl, pH 
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Pig. 3. The SDS-PAGE gels for the 
analysis of overexpressed and puri- 
fied protein. BL21(DEd)pLysS cells con- 
taining either CTM-NBD or C-NBD were 
grown and induced as described under 
"Methods." A, the expression and the local- 
ization of CTM-NBD was checked in the 
various fractions (23). The expression of 
CTM-NBD appears to be low as shown in 
the total cell protein of induced cells and 
seems to be distributed equally in the sol- 
uble and insoluble fraction. There was no 
protein detected in the periplasmic frac- 
tion. B, the purified proteins, CTM-NBD 
(34 kDa) and C-NBD (34 kDa) are shown. 
C, the expression of C-MBP fusion protein. 
The total cell protein of uninduced and in- 
duced cells is compared. A protein band 
corresponding to -HSS kDa is induct in the 
presence of 1 mM IPTG, which correspond 
to C-MBP. The expression of MBP was 
used as a control and gel shows total cell 
protein before and after induction. D, the 
cleavage by specific protease Factor Xa. 
Factor Xa (200 Mg^ml) was added to C-MBP 
(1 mg/nnl) in 50 niM Tris-HCl bufter. pH.8.0, 
containing 500 mM NaCl and the reaction 
was carried at room temperature for vari- 
ous lengths of time. Lanes 2 and 2 repre- 
sent C-MBP before and after 24 h of diges- 
tion with Factor Xa. 




Table I 

Amino acid analysis was carried out cls described under "Methods" on fusion protein C-MBP 
The liiolar ratios are based on 93 mol of His/mol of fusion protein. The value for tryptophan is not reported because it gets degraded in presence 
of 6 M HCl and is not detectable. The low value of cysteine is also attributed to Uie degradation during hydrolysis. 

Amino add Amount of amino ^ffJS^GCo/"® Experimental value Bxperimentalftheoretica] Molar ratio 



nmol 



Asx° 


7716.245 


82 


94 


1.14 


84.1 


Glx* 


6501.670 


74 


79 


1.06 


70.8 


Ser 


2286.068 


30 


28 


0.9 


24.9 


Gly 


5151.250 


50 


62 


1.26 


56.1 


His 


826.236 


9 


10 


1,1 


9.0 


Arg 


1876.950 


19 


22 


1.2 


20.4 


Thr 


2745.462 


32 


33 


1.06 


29.9 


Ala 


6270.499 


64 


76 


1.2 


68.3 


Pro 


3183.053 


29 


39 


1.3 


34.6 


Tyr 


1857.716 


19 


22 


1.15 


20.2 


Val 


3888.509 


44 


47 


1.06 


42.3 


Met 


879.015 


9 


10 


1.1 


9.6 


Cys 


60.900 


3 


1 


0.3 


0.7 


He 


3199.020 


41 


39 


0.95 


34.8 


Leu 


5209.918 


67 


63 


1.1 


56.7 


Phe 


1772,368 


23 


21 


0.9 


19.3 


Lys 


3394.617 


51 


42 


0.82 


36.9 



" Asx; asparUc add 4- asparagine. 
* Glx; glutamic acid + glutamine. 



8.0. The sample was then loaded onto a Q-Sepharose column equili- concentrated, and the protein was exchanged into water using an Ami- 
brated with the same buffer. Protein was eluted with a 0-1 M salt con ultrafiltration cell, with a YMIO membrane (Amicon Inc.). 
gradient in 10 mM Tris-HCl, pH 8.0. Appropriate fractions were pooled, Amino Acid Analysis of C-MBP— For amino add analysis, 1000 pmol 
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Fig. 4. Detection of antibody binding by Western blot. Western 
blot was performed as described under "Methods." The oligomeric forms 
corresponding to dimeric and trimeric forms of the cart>o}0^1*terminal 
NBD can be detected by monoclonal antibody. A, detection of mono- 
dona] antibody C219 binding to proteins expressed in E. colL Lane 1, 
purified CTM-NBD (50 ^ig); Lane 2, purified C-NBD (50 fig); Lane 5. 
purified C-MBP (20 /Jtg). B, binding of monoclonal antibody C494 to 
carboxyl-terminal NBD. Lane I, purified CTM-NBD (20 fjLg); Lane 2, 
purified C-NBD (10 /ig). C, detection of C-MBP (50 /ig) with C494. 

of C-MBP was vapor-phase hydrolyzed in 6 N HCl for 48 h. The resulting 
amino acids were derivatized with phenyl isothioc3^ate and separated 
on a PICO-TAG (Pharmacia) column (3.8 mm x 15 cm) using reverse 
phase chromatography. The results were analyzed using. PICO-TAG 
amino acid analysis software. 

Western Blot Analysis — ^Western blotting was performed essentially 
according to Towbin et al, (28). Briefly, either crude extract or 20-100 /ig 
of purified CJTM-NBD, C-NBD, or C-MBP was mixed with an equal 
volume of gel loading buffer which lacked /3-mercaptoethanol and electro- 
phoresed on 10% SDS-PAGE, The protein was transferred to a polyvinyli- 
dene difluoride membrane using a Bio-Rad electroblotting apparatus. 
Polyvinylidene difluoride membranes were blocked for 2 h with 10% (wA^) 
non-fat dry milk in TBS buffer (0. 1 M Tris-HCl, pH 7.2, 9% (w/v) NaCl) and 
then incubated for 1 h with 0.5 M^knl monoclonal antibody C219 or C494 
in the same buffer. The blot was thoroughly washed with TBS containing 
1% (wAr) non-fat dry milk and 0.1% (v/v) Tween 20 and then incubated for 
1 h with 1:2500 dilution of anti-mouse horseradish peroxidase-oox^ugated 
IgG in TBS buffer. After several washes with TBS buffer, the signal was 
detected by enhanced chemiluminescence (Amersham). The blot was ex« 
posed to Kodak X-Omat film for 15 s to 20 min. 

ATP-affinity Column Binding — ATP-agarose beads (20 /tl) were 
washed thoroughly with 10 mM Tris-HCl buffer, pH 8.0. Partially pu- 
rified C-NBD or C-MBP was added to the beads and then incubated on 
a shaker for 1 h at 4^C. Agarose beads were washed again with 
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Fig. 5. ATP binding characteristics of the expressed domains. 
Partially purified proteins, C-MBP and C-NBD were incubated with 
agarose beads as described under "Methods." A: Lane i, C-MBP (par- 
tially purified); Lane 2, C-MBP bound to the beads; Lane 3, wash; Lane 
4y C-MBP (purified); Lane 5, C-MBP bound to the beads; Lane 6, 
molecular weight markers; Lane 7, C-NBD (partially purified); Lane S, 
C-NBD bound to the beads; Lane 9, wash. B, binding competition with 
ATP. ATP (1-8 mM) was added to ATP-agarose beads prior to the 
incubation with C-MBP. Lanes 2, 4, 6, and 8 represent C-MBP bound to 
the beads in the presence of 0, 1, 2, and 3 mM ATP. Lanes 3, 5, 7, and 9 
represent unbound protein. 

Tris-HCl buffer and then boiled in 30 fx\ of sample buffer to release 
protein bound to the beads. The samples were electrophoresed on 12% 
SDS-PAGE. ATP (1-3 mM) was used to compete the binding of protein 
to the agarose beads. 

Circulcw Dichroism Spectroscopy (CD) — ^The fusion protein C-MBP 
(10 tm) was exchanged into water using an Amicon ultrafiltration cell 
with a YM30 membrane (Amicon Inc.). The CD spectrum for C-MBP 
was recorded between 200 and 250 nm at 23 **C in a 1-mm path cuvette 
using a Jasco J720 spectropolarimeter. 

Gel Filtration FPZ,C— C-MBP (1 mg/ml) in 10 mhi Tris-HCl, pH 8.0, 
was loaded onto a Pharmacia Superose 12 and Superose 6 FPLC gel 
filtration column equilibrated with the same buffer. The elution profiles 
of the OMBP and standards, which consisted of blue dextran (200 kDa), 
bovine serum albumin (67 kDa), ovalbumin (43 kDa), and chymo- 
trypsinogen A (25 kDa) were monitored at 280 nm. The ^ution volume 
was calculated and plotted versus log,o molecular weight. Gel filtration 
chromatography was also performed in the presence of 0. 1% CHAPS in 
10 mM Tris-HCl. pH 8.0. 

Chemical Cross-linking — Chemical cross-linking of C-MBP was per- 
formed with dithiobis(8uccimmidyl propionate) (DSP) and 3,3'-dithio- 
bis(sulfosuccinimidyl propionate) (DTSSP) (29). DSP was dissolved in 
iV,iV-dime1iiylformamide to give a final concentration of 2.5 mM and 
DTSSP was dissolved in 20 mM sodium phosphate buffer, pH 8.0, to a 
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Fig. 6. Circular cUchroism spectra 
of C-MBP. The fusion protein C-MBP (10 
and MBP (10 yM) was exchanged in 
water and the CD spectra was recorded as 
described under "Methods." The plot of 
molar ellipticity (0) versus wavelength 
(nm) is depicted. A, spectra of MBP is 
shown with dashed line and C-MBP with 
solid line. B, difference spectrum which 
represents spectrum of NBD was ob- 
tained by subtracting spectra of C-MBP 
from MBP. 
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concentration of 0.116 mM. C-MBP (0.029 niM in 20 mM sodium phosphate, 
pH 8.0) was cross-linked with 0.0135-1.0 mM DSP and 0.0145-0.116 mM 
OTSSP in 20 mM sodium phosphate buffer. After 20 min at room temper- 
ature, the reactions were terminated by adding 60 mM glycine. Hie sam- 
ples were loaded on 10% SDS-PAGE after the addition of an equal volume 
of gel-loading buiTer which lacked ^-mercaptoethanol. 

Measurement of Mg^^ -ATPase Activity Using a Colorimetric Assay — 
Mg^"^-ATP€ise activity was determined by measuring the release of 
inorganic phosphate using a colorimetric assay (30). The assay solution 
contained 100 /ig of C-MBP in 360 fd of assay buffer (50 mM Tris-HCl, 
pH 7.6. 0.15 mM NH^Cl, 5 mM MgClg, 2 mM ouabain, 100 fiM EGTA, 2 
mM dithiothreitol, and 0.89 mM CHAPS). The reaction was initiated by 
adding 40 ptl of ATP to give a final concentration of 2 mM in a 400-fd 
final volume. After 1 h at 37 "C, the reaction was terminated by adding 
400 fi} of 6% SDS, 3% ascorbate, and 0.5 M sodium molybdate in 0.5 M 
HCl. Products were stabilized by adding 400 ^1 of 2% sodium citrate, 2% 
sodium meta-arsenite, and 2% acetic acid. After incubation at 37 °C for 
10 min the absorbance was read at 850 nm. 

RESULTS 

Overexpression and Purification of CTM-NBD, C-NBD, and 
Fusion Protein C-MBP— The amplified PGR products for CTM- 
NBD and C-NBD (Pig. IB) were cloned into pT7-7 (Fig. IC) 
and pET-22b( + ) expression vectors under the control of a 
strong bacteriophage T7 promoter and the Stul-Hindlll frag- 
ment from CTM-NBD was subcloned in-frame with maltose- 
binding protein in the pMal-c2 expression vector (Fig. 2). The 
correct orientation of the clones was verified by sequencing and 
restriction digest analysis (Fig. 1, D and E), The IPTG-induci- 
ble expression of CTM-NBD and C-NBD and the amoimt of 
protein in the soluble fraction were moderate (Fig. 3A). Also the 
protein repeatedly precipitated during puiiiication. The overall 
yield of purified protein was 25-50 ftg/liter of cell culture. 
However, when expressed as a fusion with MBP, there was a 
significant increase in expression level and most of the protein 
remained soluble (Fig. 3C). Using amy lose resin, for which 
MBP has great affinity, 80 mg of fusion protein was purified 
from a liter of cell culture. Purified (TTM-NBD, C-NBD, and the 
fusion protein C-MBP were >90% pure as judged by SDS- 
PAGE (Pig. 3, B and Z>). The number of amino acids determined 
by amino acid analysis of C-MBP agreed closely with the the- 
oretical value calculated using the program GCG (Genetics 
Computer Group, Wisconsin), thus confirming the identity and 



A (nm) 

purity of the fusion protein (Table I). 

Expression vector pMal-c2 contains the sequence coding for 
the recognition site of the speciiic protease Factor Xa, located 3' 
of MBP, allowing cleavage of MBP from the protein of interest. 
Cleavage with Factor Xa for C-MBP was veiy slow: after 24 h 
only 20% of the protein was cleaved, suggesting that the pro- 
tease site may have low accessibility (Fig. 317). 

Western Blot Analysis — The purified proteins were checked 
for their ability to bind to Pgp-specific monoclonal antibodies, 
C219 (Fig. 4A) and C494 (Fig, 4, B and C), by Western blot 
analysis. C219 and C494 recognize all three carboxyl-terminal 
NBD clones, i.e. CTM-NBD, C-NBD, and C-MBP, expressed in 
E. coli, MBP alone has no affinity for the antibodies (data not 
shown). Binding of C-MBP to C219 and C494 was also con- 
firmed by enzyme-linked inmiunoassay. These results further 
confirm that the purified protein is the NBD of Pgp and not any 
other protein from E, coli of the same molecular mass. 

ATP-agarose Binding Characteristics — CTM-NBD and C-MBP 
both bind ATP-agarose beads suggesting that the NBD retains 
its secondary and tertiary structure (Fig. 5A). In a control exper- 
iment, MBP alone did not bind the beads indicating that the 
interactions wth ATP are due to the NBD alone. The binding 
properties of C-MBP, C-NBD, and CTM-NBD were the same. All 
showed decreased binding to ATP-agarose beads in the presence 
of ATP (Fig. 5B). CTM-NBD and C-MBP bind to an Affi-Gel® 
Blue afi&nity gel column (Bio-Rad) (results not shown), which is 
specific for nudeotide-binding proteins. This fiirther indicates a 
correctly folded protein that retains ATP binding ability. 

Secondary Structure of C-MBP as Determined by CD— The 
ability of the carboxyl-terminal NBD to bind to ATP suggests 
that it retains a fiinctional tertiary structure. This was fiirther 
confirmed by CD spectral analysis. A CD spectrum was re- 
corded for C-MBP between 200 and 250 nm. The plot of mean 
residue molar ellipticity of C-MBP exhibits two minima at 209 
and 220 nm (Fig. 6). From the CD spectrum, C-MBP was 
calciilated to consist of 14% a-helix, 42% 0-strand, 20% tiu^i, 
and 23% irregular structure. This secondary structtu'e calcula- 
tion was carried out using Prosec software package. Thus, 
C-MBP is highly structured. 

Oligomer Formation — ^The tendenpy of the carboxyl-terminal 
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Fig. 7. Molecular weight by gel filtration FPLC. The elution 
volume of the standards which includes blue dextran, bovine serum 
albumin, ovalbumin, and chjmiotrypsinogen A was determined and 
plotted as a fimction of logio molecular weight. Elution volume of 
purified C-NfBP was determined under the same conditions. Molecular 
mass of C-MBP was estimated to be approximately 2000 and 68 kDa 
shown by arrows on the graph. 

NBD to form oligomers was evident from gel filtration and chem- 
ical cross-linking experiments. In the former, most C-MBP elutes 
at the void volxime (with the blue dextran) of an FPLC gel filtra- 
tion column, exhibiting a molecular mass of 2000 kDa. However, 
a smaD fraction of protein also elutes with bovine serum albumin 
which has about the same Af^ as C-MBP (Fig. 7). Performing gel 
filtration chromatography in the presence of 0.1% (vA^) CHAPS 
had no effect on the elution profile of the standards and C-MBP 
(results not shown). In a control gel filtration experiment involv- 
ing MBP alone, there was no evidence for oligomerization. The 
tendency of C-MBP to oligomerize was fijuther evidenced by 
using the homobifunctional, t^o-cleavable and amine-reactive 
chemical cross-linkers DSP and DTSSP. With such reagents, 
proteins in dose proximity have the potential to be chemically 
cross-linked. With a low concentration of DTSSP (0.0073, 0.0145, 
and 0.029 mM), bands which correspond to dimeric (136 kDa) and 
trimeric (204 kDa) forms of C-MBP are seen (Fig. 8). With DSP, 
such intermediate forms could not be detected and most protein 
was present as higher order oligomers. MBP £done did not form 
oligomers with either I>TSSP or DSP. Furthermore, Western blot 
analysis confirms that the Pgp-specific monoclonal antibodies are 
able to recognize the oligomeric forms of CTM-NBD, C-NBD, and 
C-MBP (Fig. 4). 

ATPase Activity of the Fusion Protein — C-MBP exhibits very 
low Mg^*-dependent ATPase activity with a V,^ of 23.8 nmol/ 
min/mg and a of 20 mM as determined by a Lineweaver- 
Burk plot (Fig. 9). The presence of ouabain (2 mM) and EGTA 
(100 $jm) has no effect on the activity, suggesting that the 
observed ATPase activity is not due to the presence of contam- 
inating Na^-K"**- or Ca^"*^-ATPase. ATPase activity is depend- 
ent on the concentration of ATP and Mg^*^ in the assay buffer. 
In the presence of 5 mM MgCla, ATPase activity decreased with 
increasing concentration of ATP. The observed decrease may be 



Fig. 8. Identification of intermediate oligomeric forms of C- 
MBP by chemical cross-linking with DSP and DTSSP. A series of 
cross-linking experiments were performed with purified 0.029 mM C- 
MBP as described under *^ethods.* Lanes 1-4 shows cross-linking with 
increasing concentration of DSP (0.0135, 0.135. 0.5. and 1.0 mM); lane 5 
is molecular weight markers; iMnes i-lO represent cross-linking with 
increasing concentration of DTSSP (0.0073, 0.015, 0.029, 0.058, and 
0.116 mM). 
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Fig. 9. Lineweaver-Burk plot of dependence of ATPase activ- 
ity of fusion protein C-MBP. Assay buffer contained 100 fig of C- 
MBP in 400 yX of 400 jd of 50 mM Tris-HCl. pH 7.6, 0.15 mM NIi,Cl, 60 
mM MgCls, 2 mM ouabain, 100 /mM EGTA, 2 mM dithioihreitol, 0.89 mM 
CHAPS, and various concentrations of ATP (1-10 mM). V„„ and 
was calculated from the plot of 1/V (1/nmol of ATP hydrolyzed per min 
per mg of C-MBP) versus 1/S (1/IATP) mM). V^^^ of 23.8 nmol/minMig 
and a of 20 mM for ATP was calctdated from the plot. 

due to a limiting concentration of Mg^"** in the assay buffer, 
since no decrease in activity with increasing ATP concentration 
was observed when the assay was performed in the presence of 
60 mM MgCl2. In the reverse experiment, in which ATP con- 
centration was kept constant, a similar dependence of ATPase 
activity on Mg^"*" concentration was observed (Fig. 10): in the 
presence of 2 mM ATP, ATPase activity decreased as Mg^"*" 
concentration increased. The observed decrease can be attrib- 
uted to a limiting concentration of ATP. The kinetic parameters 
of the carboxyl-terminal NBD, in the absence of the Pgp trans- 
membrane heUces, are different from those of intact Pgp. The 
ATPase activity of Chinese hamster Pgp decreases in the pres- 
ence of high concentrations of ATP with maximal activity at 6 
mM ATP (12). In some cases a higher Mg^*** concentration (50 
mM) caused inhibition of ATPase activity and in other cases 
there was no inhibition up to 50 mM Mg^'^ (12). 
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Fig. 10. Dependence of ATPase activity on concentrations of BIgCl, and ATP. Each assay sample contained 100 iMjg of C-MBP in 400 
of 60 mM Tris-HCl, pH 7.6, 0.16 mM NH«C1. 5 mM MgCl,, 2 mM ouabain. 100 yUA EGTA, 2 mM dithiothreitol, and 0.89 mM CHAPS or as indicated. 
A, ATP dependence in the presence of 6 mM MgCl^. B, ATP depentklence in the presence of 60 niM MgC^. C, Mg^^ dependence in the presence of 
3 mM ATP. 



DISCUSSION 

A central question in multidrug resistance is how a single 
integral membrane protein, Pgp, can transport a wide variety 
of drugs and hydrophobic peptides. The role of ATP and how 
ATP-derived energy is harnessed in the transport process of 
Pgp and other ABC-transporters remains unknown. It has 
been suggested that the transport of solutes by an ABC-trans- 
porter across the membrane is dependent on energy derived 
from ATP and that two molecules of ATP are hydrolyzed per 
transport event (31). The latter proposal is consistent with the 
fact that most ABC-transporters contain two NBDs. The he- 
molysin and glycine-betaine transporters, which have only one 
NBD, are known to function as dimers (17). The NBDs from 
several ABC-transporters, for example, cystic fibrosis trans- 
membrane conductance regulator (CFTR) and maltose (MalK), 
histidine (HisP), and oligopeptide (OppF) permeases, have 
been overexpressed and purified (32-36). The purified domains 



are able to bind ATP and its analogues but show no ATPase 
activity. It has also been suggested that these nucleotide-bind- 
ing domains may not be able to hydrolyze ATP in the absence 
of the membrane binding components of the transporters (31). 

In this study, we have chosen the carboxyl-terminal NBD of 
Pgp as a model system for structural and functional character- 
ization of ABC-transporter NBDs. Early reports suggest that, 
like any other ABC-transporter, Fgp requires both of its NBDs 
for efficient activity and also that the ATPase activity is stimu- 
lated in the presence of drugs (33). (Georges et a/. (1991) sug- 
gested that there is cooperativity between the two NBDs of Pgp, 
as evidenced by the study of mutants having deletions in the 
amino or carboxyl half of Pgp. Deletion mutations in either do- 
main resulted in the loss of transport function, indicating that 
both halves are necessary (37, 38). This contradicts the finding 
that the amino-terminal NBD alone contains all the residues 
necessary to hydrolyze ATP (39). These studies were carried out 
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with either intact Pgp or the isolated amino-terminal half includ- 
ing TMDs 1-6. To investigate the role of the carboxyl-terminal 
NBD of Pgp in transport, chlonde^ and ATP channel activities, 
we have overexpressed this domain in E. coli. 

When overexpressed, the CTM-NBD and C-NBD proteins 
were found in the membrane fraction, indicating that either the 
protein is very hydrophobic or is associated with the mem- 
brane. This is not surprising for CTM-NBD since it contains the 
amino acid residues from TM12. C-NBD, which lacks TM12 
residues, was expressed in pET22b(+). This expression plas- 
mid contains a pe/B leader sequence, which directs proteins to 
the periplasmic space. Since C-NBD also remained associated 
with the membrane, it might be specnilated that in intact Pgp, 
the NBDs somehow associate with the membrane. Similar ob- 
servations have been made with the NBDs of bacterial per- 
meases such as HisP, OppF, and MalK and with those of 
human CFTR. When expressed in the absence of TMDs, these 
NBDs remain closely associated with the membrane (32-36). 
Although the NBDs and the TMDs of bacterial ABC-transport- 
ers are encoded by different sets of genes, the proteins tightly 
associate with each other to form the intact transporter. It is 
interesting to speculate how NBDs which are highly hydro- 
philic might interact with the membrane. It was proposed that 
a segment of NBD protrudes into or through a pore generated 
by the TMDs (35). However, this remains to be verified by, for 
example, three-dimensional structural information. 

Expression of the carboxyl-terminal NBD as a fusion with 
MBP showed tremendous improvement in the protein yield and 
solubility. There are additional advantages to expressing pro- 
tein as a fusion with MBP: the affinity of MBP for amylose can 
be used for rapid purification and the three-dimensional struc- 
ture of MBP has been elucidated (19). This coidd yield infor- 
mation useful in the ultimate structure determination of the 
Pgp NBD. The inability to efficiently cleave MBP from the NBD 
with Factor Xa suggests that the protease site has low acces- 
sibility. A similar observation was made with the NBD of CFTR 
when expressed as a fusion with MBP. It was speculated that 
the CFTR NBD may interact strongly with MBP (33). 

CTM-NBD and C-MBP retain the ability to bind ATP. Unlike 
NBDs of other ABC-transporters, the cartKMQ^l-terminal NBD of 
Pgp retains low ATPase activity when expressed in the absence of 
the other NBD and TMDs. Previous work suggests that upon 
mutating one of the two NBDs, Pgp loses its activity, indicating 
that the two NBDs function in a cooperative manner (37). Interac- 
tions of the carboxyl-terminal NBD with either or both of the TMDs 
and the amino-terminal NBD of Pgp may be reqiiired for fidl 
ATPase activity. Inhibition of ATPase activity with a hi^ concen- 
tration of both Mg*"^ and ATP, as observed with intact Pgp, was not 
observed for the carboxyl-terminal NBD. Hiis suggests that some 
regulation of the ATPase activity of intact Pgp is lost when the 
NBD is expressed in the absence of the TMDs and the other NBD. 
These results are also consistent with the fact that ATPase activity 
of intact Pgp is stimulated in the presence of drugs. 

The NBDs of Pgp share considerable sequence similarity with 
the NBDs of CFTR. For example, there is 32% identity and 56% 
similarity between the carboxyl-terminal NBD of Pgp and CFTR 
nucleotide binding fold 1. However, the carboxyl-terminal do- 
main of Pgp shows rather weak binding to TNP-ATP, a fluores- 
cent analogue of ATP, unlike nucleotide binding fold 1 of CFTR 
(33). Such differences in ATP binding behavior of these domains 
possibly evidences subtle active-site structural differences. 

It has previously been observed that Pgp exists as a dimer or 
trimer in the plasma membrane (40, 41). The results from this 
study, showing that CTM-NBD, C-NBD, and C-MBP tend to 
form dimers, trimers, and higher oligomers, suggest that the 
carboxyl-terminal NBD may play an important role in the 



self-association of Pgp within the plasma membrane. One can 
speculate from these results that intact Pgp molecules undergo 
ordered self-association through interactions between NBDs. 
However, an NBD in the absence of TMDs and the other NBD 
may undergo less ordered self-association with the resulting 
tendency to form higher oligomers. 

The successful expression and purification of the carboxyl- 
terminal NBD of Pgp is a critical step in structure-function 
studies of the molecule. In combination with an anedogous 
approach to the expression and characterization of the amino- 
terminal NBD, we hope to improve understanding of the role of 
the NBDs in the function of ABC-transporters. 
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Expression and purification of tlie first nucleotide-binding domain and linicer 
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Many membrane proteins that belong to the ATP-binding 
cassette (ABC) superfamily are clinically important, including 
the cystic fibrosis transmembrane conductance regulator, the 
sulphonylurea receptor and P-glycoprotein (multidrug resistance 
gene product; MDRl). These proteins contain two multi- 
spanning transmembrane domains, each followed by one nu- 
cleotide-binding domain (NBD) and a linker region distal to the 
first NBD. ATP hydrolysis by the NBDs is critical for ABC 
protein function; the linker region seems to have a regulatory 
role. Previous attempts to express soluble NBDs and/or linker 
regions without detergent solubilization, or to purify NBDs at 
high yields as soluble fusion proteins, have been unsuccessful. 
Here we present a system for the expression in Escherichia coli of 
the first NBD of MDRl followed by its linker region (NBDl- 
MLD). A comparison of the expressions of NBDIMLD fused to 



INTRODUCTION 

Membrane proteins that belong to the ATP-binding cassette 
(ABC) superfamily span from bacteria to humans [1-3]. These 
proteins seem to be associated with transport of solutes (from 
ions to proteins) across biological membranes. Examples of ABC 
proteins include molecules of medical importance such as the 
cystic fibrosis transmembrane conductance regulator (CFTR) 
and P-glycoprotein (multidrug resistance gene product; MDRl). 
These proteins are of particular interest because (1) mutations in 
CFTR, a Cl~ channel, cause cystic fibrosis, (2) the sulphonylurea 
receptor is directly involved in the response to oral hypoglycaemic 
agents used to treat type II diabetes, and (3) the expression of 
MDRl in cancer cells confers multidrug resistance against 
chemotherapeutic agents [1-3]. One of the distinctive charac- 
teristics of ABC proteins is the presence of nucleotide-binding 
domains (NBDs) [1-3]. The proteins mentioned above contain 
two multispanning transmembrane domains, each followed by 
one NBD, and a linker region distal to the first NBD [1-3]. 

There is increasing interest in proteins of the ABC superfamily. 
In particular there is an interest in the NBDs, which are 
responsible for nucleotide binding and hydrolysis, and the linker 
region, which is involved in regulation of function and dimer- 
ization [1-6]. The linker region of CFTR is a target for phos- 



glutathione S-transferase, thioredoxin and maltose-binding pro- 
tein (MBP) shows that a high level of expression in the soluble 
fraction (approx. 8 % of total E. coli protein) can be achieved 
only for MBP-NBDIMLD. The addition of a proteolytic 
thrombin site just proximal to the N-terminal end of NBDIMLD 
allows the cleavage of NBDIMLD from MBP, which can be 
easily purified with retention of its ATPase activity. In summary, 
success was obtained only when using an MBP fusion protein 
vector containing a thrombin proteolytic site between MBP and 
NBDIMLD. The approach described here could be generally 
applicable to solving the problems of expression and purification 
of NBDs/linker regions of ABC proteins. 

Key words: cancer, cystic fibrosis, fusion proteins, multidrug 
resistance, P-glycoprotein. 



phorylation mediated by protein kinase A (PKA) and protein 
kinase C (PKC), and is known as the regulatory domain [2]. The 
smaller similar linker region of MDRl is known as minilinker 
domain (MLD; Figure lA). Similarly to the the regulatory do- 
main of CFTR, MDRl MLD is phosphorylated by PKA and 
PKC, and its phosphorylation seems to be involved in the 
regulation of MDRl [4,5]. Specifically, it has been claimed that 
MLD phosphorylation by PKA and PKC modulates swelling- 
activated Cl~ channels [4,6a]. 

In spite of recent advances in the knowledge of the structure 
and function of NBDs and linker regions with the use of peptides 
expressed in heterologous systems [2,3,6,7], the expression of 
ABC protein fragments in Escherichia coli has major problems. 
In fact, only detergent-extracted peptides and/or fusion proteins 
have been employed (i.e. fragments of interest not exposed to 
solubilizing agents have not been studied in isolation [8-13]). The 
availability of purified functional NBDs/hnker regions in suffi- 
cient amounts for biochemical and structural studies would be 
very useful in the elucidation of the bases of ATP binding and 
hydrolysis by NBDs of ABC proteins and the mechanisms of 
protein regulation by the linker regions. 

The aim of the present study was to identify an E. coli system 
for the expression of the N-terminal NBD (NBDl) of MDRl 
followed by the MLD (NBDIMLD), with the aim of developing 



Abbreviations used: ABC. ATP-binding cassette; CFTR, cystic fibrosis transmembrane conductance regulator; DTT, dithiothreitol; GST, glutathione 
S-transferase; IPTG, isopropyl /?-D-thiogalactoside; MBP, maltose-binding protein; MDRl. human multidrug resistance gene product, human P- 
glycoprotein; MLD. MDRl linker region, minilinker domain; NBD. nucleotide-binding domain; NBDl, N-termlnal nucleotide-binding domain; NBDIMLD, 
MDRl fragment containing NBDl and MLD; NEM. A/-ethylmalelmlde; NI-IDA. Ni^^-lmldodiacetlc acid; PKA, protein kinase A; PKC. protein kinase C; 
TRX. thioredoxin. 
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Figure 1 Fusion proteins expressed in f. co// 

(A) Schematic representation of MDR1 . Abbreviation: MSD, multiple spanning domain. The first 
(Glu^") and last (Pro^^^) residues of the NBDIMLD are labelled. (B) Schematic representation 
of the fusion proteins. Abbreviations: NBD2. C-terminal nucleotide-binding domain; FLAG. 8- 
residue FLAG epitope (Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys). Specific proteolytic sites are 
Indicated. 



a system that yields sufficient soluble protein that can be easily 
purified for biochemical and structural studies. We therefore 
compared the expression and purification of MDR 1 NBD 1 M LD 
fused to the C-terminus of glutathione S-transferase (GST), 
thioredoxin (TRX) and maltose-binding protein (MBP). 

EXPERIMENTAL 

f. coli strains and vectors 

In all the experiments shown, we used the coli strain BL21 
(lacking ompT 'dnd Ion proteases). We also used JM109, DH5a 
and TOP 10 strains in pilot experiments (see the Results section). 
The NBDIMLD fragment (280 amino acids; Asn^^-Pro®*' of 
MDRl) was amplified from MDRl cDNA (ATCC 65705) by 
PGR with the following primers: forward, 5'-CGGGATCCAA- 
CCTGAAGGTGCAGAGTGG-3'; reverse, 5 -CCG CTCGAG 
CTAAGGTATACTTTCATCCAGAG-3'. These primers con- 
tained Bam\\\ and Xhol sites (underlined) for cloning of the 
DNA fragment, in frame, into the fusion expression vectors. The 
expression vectors were pGEX-4t-3 (GST fusion vector; Phar- 
macia), pThioHis C (TRX fusion vector; InVitrogen) and pMal- 
c2 (MBP fusion vector; New England Biolabs). The GST, TRX 
and MBP fusion vectors contained thrombin, enierokinase and 
Factor Xa proteolytic sites respectively, to cleave the protein of 
interest from the fusion protein. The PGR fragment digested 
with BamWl and Xhol was cloned into pGEX-4t-3, pThioHis 
G and pMal-c2 by using the BamH\-Xho\, BglW-Sall and 
BamHl-Sall sites respectively. The expression plasmids were 
named pGEX-NBDlMLD, pTRX-NBDlMLD and pMBP- 
NBDIMLD. The NBDIMLD DNA was also cloned into a 
modified pMal-c2 vector to which we added a thrombin site 
(underlined) by using the following oligonucleolide adaptors: 



forward, 5 -AATTCGGTACC CTGGTTCCGCGT G-3^ : re- 
verse, 5 -GATCC ACGGGGAACCAG GGTACCG-3 . The 
annealed oligonucleotides, which contained a Kpnl site for easy 
primary screening, were ligated into the pMal-c2 vector cut with 
EcoRl and BamHl. The NBDIMLD DNA cut with BamHl and 
Xhol was cloned into the BamHl and Sail sites of the new 
expression vector (pMBPT); the resulting plasmid was named 
pMBPT-NBDlMLD. Finally, we also amplified NBDIMLD 
containing a FLAG epitope (Asp-Tyr- Lys-Asp- Asp-Asp- Asp- 
Lys) [14] at the C-terminus. We used the forward primer described 
above and the reverse primer 5'-GGGGTGGAGTGAGTTG- 
TGATGGTGGTCGTTGTAGTGAGGTATAGTTTGATGGA- 
GAGCGTC-3'. The PGR product was cut with BamHl and 
Xhol, then cloned into the same sites of pGEX-4t-3, to express 
NBDIMLD fused to GST at the N-terminus and a FLAG 
epitope at the G-terminus (pGEX-NBDlMLDFLAG). Se- 
quences of plasmid clones were confirmed by DNA sequencing. 
A schematic representation of all fusion proteins employed in 
the current studies is shown in Figure 1(B). 

Expression of recombinant NBD1MLD 

Unless stated otherwise, bacteria transformed with the expression 
vectors with or without NBDIMLD cDNA were grown in 
100 ml of Luria-Bertani medium with 0.1 mg/ml ampicillin and 
0.5% glucose to an A^^,^ of 0.9-1.0 at 37 °G. Induction was 
performed with 0.1 mM isopropyl /?-D-thiogalactoside (IPTG) 
at 37 ®C; bacteria were harvested 2 h later by centrifugation at 
4000 ^ for 15 min. The bacterial pellet was frozen at —80 ®C in 
the specific binding buffers for the different fusion proteins, as 
follows: for GST fusion proteins, 150 mM NaGl/20 mM sodium 
phosphate/0.5 mM dithiothreitol (DTT)/10mM EDTA (pH 
7.3); for GST/FLAG fusion protein, 50 mM Tris/HGl/150 mM 
NaGl/0.5mM DTT/lOmM EDTA (pH 7.4); for TRX fusion 
protein, 20 mM sodium phosphate/500 mM NaGl (pH 7.8); for 
MBP fusion proteins, 20 mM Tris/HGl/200 mM NaGl/1 mM 
EDTA (pH 7.4). PMSF from a stock in ethanol was added to the 
binding buffers at a concentration of 0.1-0.5 mM. The bacteria 
were thawed at 37 °G and lysed with a probe sonicator (three 
times at 150W for 10 s, with 8 min intervals in a solid-GO.^/ 
methanol bath). Part of the lysate was kept for SDS/PAGE 
analysis; the remainder was centrifuged at 265000 g for 15 min. 
The fusion proteins in the supernatant were then purified as 
described below. In some experiments the lysate was incubated 
with 1 % (v/v) Triton X-100, then mixed continuously on a 
rotary mixer for 30 min. The resulting supernatant was collected 
by centrifugation as described above. 

Purification of NBD1IVIL0 hybrid proteins 

Purification of GST fusion proteins was performed at room 
temperature. The soluble fractions containing GST fusion pro- 
teins were loaded on prepacked glutathione-Sepharose 4B 
columns (2 ml; Pharmacia), washed with 150 mM NaGl/20 mM 
sodium phosphate (pH 7.3) (10 column vol.) and eluted with 
10 ml of 50 mM Tris/HGl, pH 8.0; 10 fractions of 1 ml were 
collected. The soluble fraction containing NBDIMLD with 
a FLAG epitope was loaded into a I ml column packed 
with FLAG M2 affinity gel (Eastman Kodak). The column was 
washed with 10-15 vol. of binding buffer; the fusion protein 
was eluted with 10 ml of O.l M glycine, pH 3. Each 1 ml fraction 
collected was immediately neutralized with 20 /t\ of 1 M Tris 
base, pH 8.0. The soluble fraction containing the TRX fusion 
protein was loaded on a 3 ml column packed with a Ni^"*"- 
imidodiacetic acid (Ni-IDA) affinity gel (ProBond; InVitrogen). 
The column was washed with 8 ml of binding buflfer, then with 
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20 mM sodium phosphate/500 mM NaCl (pH 6.0), until the 
eluate A^^^ was less than 0.01. Then the elution proceeded with 
20 mM sodium phosphate/500 mM NaCl (pH 6.0) containing 
increasing concentrations of imidazole in sequential steps. The 
concentrations of imidazole were 50, 200, 350 and 500 mM, 5 ml 
each step; fractions of 1 ml were collected. All the steps for 
TRX-NBDIMLD purification were performed at 4 °C. Purifi- 
cation of MBP fusion proteins was also conducted at 4 ®C. The 
soluble fractions containing the MBP fusion proteins were loaded 
into a I cm X 10 cm column packed with amylose affinity gel 
(New England Biolabs) and washed with 10-15 column vol. of 
the binding buffer. Elution was performed with 15 ml of the same 
buffer containing 10 mM maltose. Fractions of 3 ml were col- 
lected. 

PurlficaUon of NBD1MLD 

Purified MBPT-NBDIMLD in the elution buffer was digested 
with thrombin (10 units/mg of fusion protein; Calbiochem, La 
Jolla, CA, U.S.A.) for 2 h at room temperature. At the end of the 
digestion period, 1 mM benzamidine was added to inactivate the 
thrombin. The reaction mix was desalted on a PD-10 column 
(Pharmacia) equilibrated with 20 mM Tris/HCl/100 mM 
NaCl/lOmM DTT (pH 7.0) and purified on a 3 ml DEAE- 
Sephacel column (Sigma, St. Louis, MO, U.S.A.). After washing 
with 15 vol. of the same buffer, elution was performed by 
lowering the buffer pH to 6.5 and increasing the NaCl con- 
centration to 0.5 M. The fractions containing NBDIMLD were 
pooled and desalted. NBDIMLD was finally purified by gel 
filtration after exchange of the buffer to 50 mM Tris/HCl/ 
150 mM NaCl/O.lmM EDTA/1 mM DTT (pH 7.4). The 
DEAE-purified sample was subjected to gel filtration on an 
FPLC system (Pharmacia). The sample was loaded on a 
1 cm X 30 cm column packed with Superdex-75 or Superdex-200 
(Pharmacia) pre-equilibrated with the sample buffer. Elution was 
at a flow rate of 0.4 ml/min (with the same buffer). Fractions 
of 0.8 ml were collected and analysed for the presence of 
NBDIMLD by staining with Coomassie Blue after PAGE. 

Determination of ATPase activity 

ATPase activity was calculated from P^ release measured color- 
imetrically [15]. Determinations were performed in 125 /*1 of 
50 mM Tris/HCl/0.1 mM EDTA/1 mM DTT (pH 8.0) in the 
presence of 5 mM ATP and 2.5 mM MgClg. Reactions were 
initiated by addition of the MgClj, and proceeded for 10 min at 
37 *'C. No ATP hydrolysis was detected in the absence of MgClg 
or protein ; P, release was constant during the reaction. 

Other techniques 

SDS/PAGE, Western-blotting analysis and protein determina- 
tions were performed with standard techniques, as previously 
described [16,17]. The C219 monoclonal antibody (Signet, 
Dedham, MA, U.S.A.) recognizes an epitope in MDRl NBD 
[18]. Densitometric analysis of Coomassie Blue-stained gels 
were performed with SigmaGel (Jandel). Peptide sequencing was 
performed by J. S. Smith (Protein Chemistry Laboratory, Uni- 
versity of Texas Medical Branch, Galveston, TX, U.S.A.). 

RESULTS AND DISCUSSION 

In spite of the recent advances in the knowledge of the structure 
and function of NBDs and linker regions expressed in hetero- 
logous systems, the expression of ABC protein fragments in E, 




Figure 2 Expression and purification of GST and TRX fusion proteins 

Analysis of expression and purification of fusion proteins by SDS/PAGE. Proteins were stained 
with Coomassie Slue. (A) Expression and purification of GST fusion proteins. Lanes show 

protein electrophoresis of cell lysate (Lys) from cells induced with IPTG (1) or uninduced (Ul), 
soluble fraction (Sol) without ( — 1) or with ( + T) Triton X-100 extraction, and purification by 
glutathione-Sepharose 4B (GST) and M2-qB\ affinity chromatography (FLAG). Results from 
BL21 cells transformed with pGEX-NBDlMLDFLAG are shown in all cases except in the lane 
labelled GST, for which cells were transformed with pGEX-NBDIMLD (no FLAG epitope at the 
C-terminus). The lane labelled WB corresponds to a Western blot of purified GST-NBD1MLD. 
(B) Expression and purification of TRX-NBD1MLD. Lane labelling is as described for (A) except 
for Ni-IDA, which denotes purification by agarose-Ni-IDA affinity chromatography. The lane 
labelled WB is an immunoblot of the Ni-IDA affinity purification. The protein seen in the Ni-IDA 
lane is not TRX-NBD1MLD, but a co-purified protein that was not recognized by the anti-(P- 
glycoprotein) monoclonal antibody C219 in immunoblots (the protein was present in elution 
fractions without NBD1MLD; results not shown). Soluble TRX-NBD1IVILD was never detected 
in gels stained with Coomassie Blue. The anti-(P-glycoproteln) antibody C219 was employed 
for all immunoblots. The equivalent of 0.3 ml of E coli growth was loaded in the Lys lanes. 
The Sol lanes contained 250 jug of protein. The amount of protein in lanes GST, FLAG and WB 
of (A) was 0.3 /tg. The amount of protein In the Ni-IDA and WB lanes of (8) was 0.5 /ig. The 
arrows indicate the position of the fusion proteins. 



co// has been problematic. First, the expression of NBDs of ABC 
proteins and the regulatory domain of CFTR is directed to 
inclusion bodies [8-1 1]. This problem has been solved in part by 
expression of the fragments as fusion proteins. However, some of 
the fusion peptides are still directed exclusively to inclusion 
bodies [8-11]. Secondly, on many occasions, fusion proteins are 
not recovered as soluble proteins in the absence of Triton X-100 
[8,12,13], and it is likely that detergent extraction disrupts NBD 
function [19]. Finally, the expression of the linker regions, either 
by themselves or following the N-terminal NBD (NBDl), is toxic 
and considerably decreases the expression in E. coli [20]. There- 
fore there are many aspects of the expression/purification of 
NBDs/linker regions of ABC proteins that need to be improved 
to obtain preparations suitable for detailed biochemical and 
structural studies. 

Because it is of interest to express the first NBD of ABC 
proteins with the linker region to establish whether this region 
regulates NBD function [21], we decided to identify an adequate 
expression system in £". coli suitable for this task. Because of our 
primary interest in MDRl, we wished to optimize the expression 
and purification of NBDIMLD, but our results could be 
applicable to equivalent fragments of other ABC proteins as well 
as to poorly soluble proteins in general. Our results also provide 
comparative information on the expression of GST, TRX and 
M BP fusion proteins that might be useful to researchers interested 
in protein expression in £. coli. 

Expression of NBDIMLD fused to GST or TRX as soluble proteins 
Is low 

Figure 2 shows that the GST, with or without (results not shown) 
a FLAG epitope, and the TRX fusion expression systems did not 
yield high levels of expression of soluble NBDIMLD fusion 
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Figure 3 Expression and purification of NBD1IVILD fused to IMBP 

(A) Expression and purilication of MBP-NBD1 MLD. Lane labelling is as described in the legend 
to Figure 2 except tor MBP, which refers to purification by amylose-affinity chromatography. 
The arrow indicates the position of the fusion protein monomer. The C219 antibody was used 
for the Western blot (WB lane). (B) Analysis of the purilication of NBD1MLD from the MBP 
fusion protein. Proteins were analysed by SOS/PAGE and stained with Coomassie Blue. Lane 
MBP. puritication of NBD1MLD fused to MBP by amylose-affinity chromatography; lane 
Thrombin, thrombin cleavage of NBD1 MLD from the fusion protein; lane DEAE. removal of MBP 
by DEAE anion-exchange chromatography; lane S-75. NBD1MLD purification by gel filtration 
on Superdex-75. The small arrows point to the MBP fusion protein and NBD1MLD. The 
equivalent of 0.3 ml of £ cofi grovirth was loaded in the Lys lanes. The Sol lane contained 
250 //g of protein. The amount of protein in lanes MBP and WB of (A) was 0.5 ^g. The amount 
of protein in the MBP, thrombin and S-75 lanes of (B) was 6-7 //g. The DEAE lane contained 
approx. 1 /tq of protein. The large arrow points to the NBD1MLD dimer subjected to 
electrophoresis after gel filtration. 



proteins. A FLAG epitope at the C-terminus of GST-NBDl- 
MLD can be used for the chromatographic separation of GST- 
NBDIMLD from GST or truncated proteins containing 
GST, without compromising the expression (compare GST and 
FLAG lanes in Figure 2A). The expression of soluble or Triton 
X-lOO-extracted GST fusion proteins, as well as that of soluble 
TRX-NBDIMLD, was not affected significantly by heat shock 
of the bacteria immediately before induction, the use of several 
different E. coli strains, lowering the temperature after IPTG 
induction, varying the IPTG concentration, induction at lower 
bacterial density, or increasing the induction time to 3.S h (results 
not shown). Our results are consistent with the observations that 
expression of NBDs as fusion proteins with polyhistidine or GST 
did not improve solubility (see Figure 2A) [8-12]. Although we 
do not know the reasons for the low expression of soluble 
NBD/linker-region-containing proteins, it might be related to 
their hydrophobicity. It is known that the overexpression of 
membrane proteins is difficult [24] and that NBDs of CFTR and 
histidine permease do solubilize in lipid membranes [25-28]. 

Expression of NBD1MLD fused to MBP as a soluble protein is 
significant, and functional NBD1IVILD can be purified from the 
fusion protein by using a vector witti an added tlirombin site 

Figure 3(A) illustrates the expression of MBP-NBDIMLD in 
BL21 cells. Although more than 70% of the fusion protein was 
directed to inclusion bodies, a significant amount (approx. 8 % 
of E. coli protein) was expressed as soluble protein (Figure 3A). 
Moreover, the fusion protein could be purified in a single step on 
the basis of its binding to amylose (Figure 3 A). Formation of the 
high-molecular-mass aggregates of MBP-NBDIMLD seen in 
Figure 3(A) could be prevented by 1 mM DTT (compare the 
MBP lanes in Figures 3A and 3B), suggesting a role of disulphide 
bonds from the Cys in motif A of NBD (the only Cys in 
MBP-NBDIMLD; Cys^»» of MDRl). 

Unfortunately, cleavage of NBD 1 MLD from the fusion pro- 



Table 1 Purification and ATPase activity of MBP-NBDIMLD and NBD1MLD 

Total protein refers to the protein amount/litre of bacterial growtfi. Approximate purity was 
calculated from densitometry of Coomassie Blue-stained gels. Tfie ATPase activity is tfie 
mean±S.E.M. and tfie number of determinations is sfiown in parentfieses. Abbreviation: n.d., 
not determined. See the Experimental section for details. 



Preparation 


Total protein 
(mg/l of 
culture) 


ATPase activity 
(nmol of 
P,/min per mg) 


Purity (%) 


Soluble fraction 


580 


n.d. 


12 


Amylose affinity 


70 


57 ± 2 (5) 


> 95 


DEAE chromatography 


18 


92 ± 2 (3) 


> 80 


Gel filtration 


3.5 


144 ±4 (3) 


>95 



tein using the Factor Xa site could not be optimized, because we 
always observed the proteolysis of NBD 1 MLD. Inefficient and 
non-specific digestion is a common observation when using 
Factor Xa [30-32]. These results are in agreement with studies 
showing the high-level expression of NBDs of ABC proteins 
fused to MBP, which could not be cleaved off efficiently with 
Factor Xa [30,31]. Because we could obtain adequate digestion 
of NBDIMLD from GST-NBDIMLD by using thrombin, we 
modified the commercial MBP fusion protein vector pMal-c2 by 
adding a thrombin site. The resulting fusion protein containing 
the added thrombin site between MBP and NBDIMLD (MB- 
PT-NBDIMLD) was expressed at a level undistinguishable 
from that of MBP-NBDIMLD. Furthermore NBDIMLD could 
be cleaved off from MBP and purified by a combination of 
anion-exchange and gel-filtration chromatographies (see Figure 
3B). MBP did not bind to DEAE and could be removed easily 
from the thrombin digestion reaction. The DEAE-purified ma- 
terial was approx. 80% NBDIMLD [two proteins more than 
30 kDa, barely seen in the DEAE lane of Figure 3(B), were 
enriched]. A highly purified (more than 95 %) NBDIMLD could 
be collected in the void volume of the Superdex-75 gel-filtration 
column (also in the void volume when Superdex-200 was used). 
Formation of aggregates is a common feature of NBD peptides 
[30]. 

The identity of MBP-NBDIMLD was confirmed by the 
presence of MBP (the fusion protein can be purified on an 
amylose column; see Figure 3), the apparent molecular mass of 
the fusion protein (approx. 78 kDa; Figure 3) and the thrombin 
cleavage products (approx. 42 and 36 kDa; Figure 3B), the 
presence of the C219 epitope (near the end of NBDl ; see Figure 
3A), and the N-terminal sequence (Gly-Ser-Asn-Leu-Lys) 
of NBDIMLD after the digestion of MBP-NBDIMLD by 
thrombin. 

Finally, we measured the ATPase activities of purified MB- 
PT-NBDIMLD, DEAE-purified NBDIMLD and purified 
NBD 1 MLD. All peptides had ATPase activities comparable with 
those of other NBDs of ABC proteins. The ATPase activity of 
MBP-NBDIMLD was inhibited by 47±3% (mean±S.E.M., 
/I = 4) by 500 /*M of //-ethylmaleimide (NEM). NEM is a thiol 
reagent known to block P-glycoprotein ATPase activity [33]. 
Interestingly, the inhibition of the ATPase activity by NEM was 
abolished when 5 mM ATP was present before the addition of 
NEM (4±2% inhibition as compared with control; n = 3). 
Protection by ATP against NEM inhibition has been described 
for full-length P-glycoprotein [33]. These results indicate that the 
NBDIMLD is functional and that the anion-exchange and gel- 
filtration purification procedures do not decrease tlie ATPase 
activity of NBDIMLD (see Table 1). 
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Summary 

Comparison of NBDIMLD expression as fusion proteins with 
GST, with TRX and with MBP shows that a high level of 
expression in the soluble fraction (approx. 8% of total E. coli 
protein) can be achieved only for MBP-NBDIMLD (see Table 
1). The addition of a proteolytic thrombin site just proximal to 
the N-terminal end of NBDIMLD permits the cleavage of 
NBDIMLD from MBP. NBDIMLD can be purified easily by 
ion-exchange and gel-filtration chromatographies with retention 
of its ATPase activity. The approach described here could be 
generally applicable to solving the problems of expression and 
purification of NBDs/linker regions of ABC proteins. 
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